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TTMTfi TTR ^SSOrTATED KAPOST 'c Q&T?PQMA VTPTTS SEQUENCES AND 
TTggS THEREOF 



The invention disclosed herein was made witr. 
Government support under a co-operative agreement 
CCU210eS2 from the Centers for Disease Control ana- 
Prevention, of the Department of Health and Human 
Services- Accordingly, the U.S. Government has 
certain rights in this invention. 

This application is a continuation-in-part application 
of U.S. Serial No. 08/420,235, filed on April 11, 1995 
which is a continuation-in-part application of U.S. 
Serial No. 08/343,101, filed on November 21, 1994, a 

continuation-in-part application of U.S. Serial No. 

OB/292,365, filed on August 18, 1994, which is hereby 

incorporated by reference. 

Throughout this application, various publications may 
be referenced by Arabic numerals in brackets. Full 
c _ atlons fcr t hese publications may be found at the 
end of each Experimental Details Section. The 

£ r,,^ 1 ^ - = one r -*-<=» d herein are m 
disclosures o: t ne puo_-^a_-o~ s t__w_^ 

their entirety hereby incorporated by reference into 
this application to more fully describe the state of 
the art to which this invention pertains. 

RACKGRQTTTJD OF T ^* TNVENTION 

Kaposi's sarcoma (KS) is the most common neoplasm 
occurring in persons with acquired immunodeficiency 
syndrome (AIDS). Approximately 15-20% cf AIDS 

patients develop this neoplasm which rarely occurs m 
immunocompetent' individuals [13, 14]. Epidemiologic 
evidence suggests that AIDS - associated KS (AIDS-KS) 
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has an infectious etiology. Gay and bisexual AIDS 
patients are approximately twenty times more likely 
than hemophiliac AIDS patients to develop KS , and KS 
may be associated with specific sexual practices among 
S gay men with AIDS [6, 15, 55, 83]. KS is uncommon 

among adult AIDS patients infected through 
heterosexual or parenteral HIV transmission, or among 
pediatric AIDS patients infected through vertical HIV 
transmission [77] . Agents previously suspected of 

10 causing KS include cytomegalovirus, hepatitis 3 virus, 

human papillomavirus, Epstein-Barr virus, human 
herpesvirus 6, human immunodeficiency virus (HIV) , and 
Mycoplasma penetrans [18, 23, 85, 91, 92]. Non- 
infectious environmental agents, such as nitrite- 

15 inhalants, also have been proposed to play a role in 

KS tumorigenesis [33] . Extensive investigations, 
however, have not demonstrated an etiologic 
association between any of these agents and AIDS-KS 
[37 , 44 , 46 , 90] . 
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SUMMARY O* 1 THE INVENTION 

This invention provides an isolated DNA molecule which 
is at least 30 nucleotides in length anc writer. 
5 uniquely defines a herpesvirus associated with 

Kaposi's sarcoma. This invention provides an isolated 
herpesvirus associated with Kaposi's sarcoma. 

This invention provides a method of vaccinating a 
L C subject for KS , prophylaxis diagnosing or treating a 

subject with KS and detecting expression cf a DNA 
virus associated with Kaposi's sarcoma in a cell. 
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ttPTFF DESCRIPTION OF THE FIGURES 
Figure Is 

Agarose gel electrophoresis of RDA produces trorr. 
AIDS-KS tissue and ur.involvea "issue. R.DA was 
performed on DNA extracted from KS skin tissue 
and uninvclved normal skin tissue obtained at 
autopsy from a homosexual man with AIDS-KS. lane 
1 shows the initial PGR amplified genomic 
representation of the AIDS-KS DNA after Bam HI 
digestion. Lanes 2-4 show that subsequent cycles 
of ligation, amplification, hybridization ana 
digestion of the RDA products resulted in 
amplification of discrete bands at 380, 450, 540 
and 680 bp. RDA of the extracted AIDS-KS DNA 
performed against itself resulted in a single 
band at 540 bp (lane 5) . Bands at 380 bp and 680 
bp correspond to KS330Bam and KS627Bam 
respectively after removal of 28 bp priming 
sequences. Bands at 4 50 and 54 0 bp hybridized 
nonspecif ically to both KS and non-KS human DNA. 
Lane M is a molecular weight marker. 

2 5 Figures 2A-2B: 

Hybridization of 32 P-labelled KS3 3 03am (Figure 2A) 
and KS6273am (Figure 23) sequences to a 
representative panel of 19 DNA samples extracted 
from KS lesions and digested with Bam HI. 
3C KS3303am hybridized to 11 of the 19 and KSS273am 

hybridized to 12 of the 19 DNA samples from AIDS- 
KS lesions. Two additional cases (lanes 12 and 
13) were shown to have faint bands with both 
KS3303am and KS627Bam probes after longer 

3 5 exposure. One negative specimen (lane 3) did not 

have microscopically detectable KS in the tissue 
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specimen- Seven of 8 additional KS DNA samples 
also hybridized to both sequences. 

Figure s 3A-3Fl 

Nucleotide sequences of the DNA herpesvirus 

associated with KS (KSKV) - 

Ficruree 4A-4B : 

PGR amplification of a representative set of KS- 
derived DNA samples using KS330- : .. primers. 
Figure 4A shows the agarose gel of the 
amplification products from 19 KS DNA samples 
(lanes 1-19) and Figure 45 shows specific 
hybridization of the PCR products to a 3: ? end- 
labelled 25 bp internal oligonucleotide (Figure 
3B) after transfer of the gel to a nitrocellulose 
filter. Negative samples in lanes 3 and 15 
respectively lacked microscopically detectable KS 
in the sample or did not amplify the constitutive 
c = 3 exon 6, suggesting that these samples were 
negative for technical reasons. An additional 8 
AIDS-KS samples were amplified and a_± were 
positive for KS3 3 0 234 . Lane 20 is a negative 
control and Lane M is a molecular weight marker. 

Figure 5 : 

Southern blot hybridization of KS33D3am and 
KS627Bam to AIDS-KS genomic DNA extracted from 
three subjects (lanes 1, 2, and 3) and digested 
with PvuII- Based on sequence information 

(Fiaure 3A) , restricted sites for ?vu II occur 
between bo 12361-12362 cf the KSKV sequence 
(Figure 3A, SEQ ID NO: 1), at bp 134 m KS3303am 
(Figure 33, SEQ ID NO: 2) and bp 414 m KSS273am 
(Figure 3C, SEQ ID NO: 3) - KS33GBam and KSS27Bam 
failed to hybridize to the same fragments in the 
diaests indicating that the two sequences 
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separated from each other by one cr more 
intervening Bam KI restriction fragments. 
Digestion with Pvu II and hybridization to 
KS3303am resulted in two distinct banding 
patterns (lanes 1 and 2 vs. lane 3) suggesting 
variation between KS samples. 



10 



Figure 6 : 

Comparison of amino acid homologies between E3V 
ORF BDLF1, HSVSA ORF 2 6 and a 913 bp reading 
frame of the Kaposi's sarcoma agent which 
includes KS330Bam. Amino acid identity is 
denoted by reverse lettering. In HSVSA, ORF 26 
encodes a minor capsid V?23 which is a late gene 



15 proauc: 



Figure 7 : 

Subculture of Ra j i cells co- cultivated with 3C3L- 
1 cells treated with TP A for 2 days. PCR shows 
that Raji cells are positive for KSHV sequences 
and indicate that the agent is a transmissible 



virus . 



3 0 
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Ficmre 8 : 

A schematic diagram of the orientation of KSHV 
cpen reading frames identified on the KS5 20,710 
bp DNA fragment. Homciogs to each open reading 
frame from a corresponding region of the 
herpesvirus saimiri (HSVSA) genome are present in 
an identical orientation, except for the region 
corresponding to the ORF 28 of HSVSA (middle 
schematic section) . The shading for each cpen 
reading frame corresponds to the approximate % 
amino acid identity for the KSHV ORF compared to 
this homolog in HSVSA. Noteworthy homologs that 
are present in this section of DNA include 
homologs to thymidine kinase (ORF21), gK 
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glycoprotein (ORF22) , major capsid ? rct ein 
(ORF25) and the VP23 protein (ORF26) whicn 
contains the original KS330Bam sequence derived 
by representational difference analysis. 

Figure 9 : 

The -2 0 0 kD antigen band appearing or. a Western 
blot cf KS patient sera against BZBL1 ivsate ;»-.' 
and Raji lysate (RA) . M is molecular weight 
marker . 

kD and 24 0 kD . 



The antigen is a doublet between ca . 210 



Figure 10: 

5 control patient sera without KS (A1N, A2N, A3N, ■ 
A4N and A5N) . B1=BCBL1 lysate, RA=Raji lysate. 
The 220 kD band is absent from the Western blots 
using patient sera without KS . 

Figure 11: 

In this figure, 0.5 ml aiiquots of the gradient 
have been fractionated (fractions 1-62) with the 
3 0% gradient fraction being at fraction No. 1 and 
the 10% gradient fraction being at fraction No. 
62. Each fraction has been dot hybridized to a 
nitrocellulose membrane and then a >: ?- labeled 
KSHV DNA fragment, KS63lBam has been hybridized 
to the membrane using standard techniques. Tne 
figure shows that the major solubilized fraction 
cf the KSHV genome bands (i.e.. is isolates! m 
fractions 42 through 4 8 cf the gradient witn a 
high concentration of the genome being present in 
fraction 44. A second band cf solubilized KSHV 
DNA occurs in fractions 26 through 32. 

F ^ Location, feature, and relative homologies cf KS5 
ooen reading frames compared to translation 
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products of herpesvirus saimiri (HSV) , equine 
herpesvirus 2 (EHV2 ) and Epstein-3arr virus 
(EBV) . 

5 Fimire 13: 

Indirect immunofluorescence end-pcir.t ana 
geometric mean titers (GMT) in AIDS-KS and AIDS 
control sera against BHL-6 and P3K3 prior tc ana 
after adsorption with P3H3 . 

10 

Figure 14 : 

Genetic map of KS5, a 20.7 kb lambda phage clone 
insert derived from a human genomic library 
prepared from an AIDS-KS lesion. Seventeen 
partial and complete open reading frames (ORFs) 
are identified with arrows denoting reading frame 
orientations. Comparable regions of the Epstein- 
Barr virus (EBV) and herpesvirus saimiri (HVS) 
genomes are shown for .comparison. Levels of 
amino acid similarity between KSHV ORFs are 
indicated by shading of EBV and HVS ORFs (black, 
over 70% similarity; dark gray, 55-70% 
similarity; light gray, 40-54% similarity; white, 
no detectable homology) . Domains of conserved 
herpesvirus sequence blocks and locations of 
restriction endonuciease sites used in subcloning 
are shown beneath the KSHV map <B, Sam HI sire; 
N, Not I site) . The small Bam HI fragment 
(black) in the VP23 gene homo log corresponds to 
30 th e KS330Bam fragment generated by 

representational difference analysis which was 
used to identify the KS5 lambda phage clone. 

Fi gures 15A -15B: 
, 5 phyloaenetic trees of KSHV based on comparison of 

aligned amino acid sequences between 
herpesviruses for the MCP gene and for a 
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concatenated nine-gene set. The comparison cf 
MC? sequences {Figure ISA) was obtained oy tne 
neighbor-joining method and is shown m unroctec 
form with branch lengths proportional to 
divergence (mean number of substitution events 
per site) between the nodes bounding eacn rrancn. 
Comparable results were obtained by maximum 
parsimony analysis. The number cf times out of 
100 bootstrap samplings the division indicated oy 
each internal branch was obtained are shown next 
zo each branch; bootstrap values below 75 are not 
shown. Figure 15B is a phyiogenetic tree of 
gammaherpesvirus sequences based on a nine-gene 
set CS1 (see text) and demonstrates that KSHV is 
most closely related to the gamma- 2 herpesvirus 
sublineage, genus Rhadinovirus . The CS1 amino 
acid sequence was used to infer a tree by the 
Protml maximum likelihood method; comparable 
results, not shown were obtained with the 
neighbor- joining and maximum parsimony methods. 
The bootstrap value for the central branch is 
marked. On the basis of the MCP analysis, the 
root must lie between E3V and the other three 
species. Abbreviations for virus species used in 
the sequence comparisons are 1) 
Alphaherpesvirinae: K5V1 and HSV2 , herpes 

simplex virus types 1 and 2; EHV1 , equine 
herpesvirus 1; PRV, pseudorabies virus; and VZV, 
varicella- zoster virus, 2) Betaherpesvinnae : 
HCMV, human cytomegalovirus; HHV6 and KHV7 , human 
herpesviruses 6 and 7 , and 3) Gammaherpesvirinae : 
KVS, herpesvirus saimiri ; — v - > 

i va\7 r-3arr virus; and 

herpesvirus 2; c.3 v , t?s 1- = 

Kaoosi's sarcoma- associated herpesvirus. 



35 
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Figures 16A-16B: 

CHEF gel electrophoresis of BCBL-1 DNA hybridized 
to KS631Bam (Figure 16A) and EBV terminal repeat 
{Figure 16B) . KS631Bam hybridizes to a band at 
5 270 kb as well as to a diffuse band at the 

origin. The EBV termini sequence hybridizes to 
a 150-160 kb band consistent with the linear form 
of the genome. Both KS6313am (dark arrow} and an 
E3V terminal sequence hybridize to high molecular 
^0 weight bands immediately below the origin 

indicating possible concatemeric or circular DNA. 
The high molecular weight KS631Bam hybridizing 
band reproduces poorly but is visible on the 
original autoradiographs . 

2 5 

Figure 17 ; 

Induction of KSHV and EBV replication in BCBL-1 
with increasing concentrations of TPA. Each 
determination was made in triplicate after 4 8 h 

20 of TP A incubation and hybridization was 

standardized to the amount of cellular DNA by 
hybridization to beta-actin. The figure shows 
the mean and range of relative increase in 
hybridizing genome for EBV and KSHV induced by 

25 TPA compared to uninduced BCBL-1. TPA at 20 

ng/ml induced an eight -fold increase in EBV 
genome (upper line) at 48 h compared to only a 
1.4 fold increase in KSHV genome (lower line). 
Despite the lower level of. KSHV induction, 

30 increased replication of KSHV genome after 

induction with TPA concentrations over 10 ng/ml 
was reproducibly detected. 

Figures 18A-18C: 

3 5 r n situ hybridization with an ORF2 6 oligomer to 

BCBL-1, Raji and RCC-1 cells. Hybridization 
occurred to nuclei of KSHV infected BCBL-1 
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{Figure ISA) , but not to uninfected Raji ce,.s 
(Figure 18B) . RCC-1, a Raji cell line derived by 
cultivation of Raji with BC3L-1 in communicating 
chambers separated by a 0.45 n filter, shows rare 
- cells with positive hybridization to the KSKV 

ORF2 6 probe (Figure ISO . 

Figures 19A-19D: 

Representative example of IFA staining cr 3----o 
with AIDS-KS patient sera and control sera rrom 
HIV-infected patients without KS . Beth AIDS-KS 
{Figure 19A) and control {Figure 193) sera show 
homogeneous staining of BKL.-S at 1:50 dilution. 
After adsorption with paraformaldehyde- fixed ?3K5 
15 zo remove cross - react ing antibodies directed 

against lymphocyte and E3V antigens, antibodies 
from AIDS-KS sera localize to BKh-6 nuclei 
(Figure 19C) . P3K3 adsorption of control sera 
eliminates immunof iuorescent staining of BKL-6 
2: (Figure 19D) . 

Figures 20A-20B: 

Longitudinal PCR examination for KSKV DNA of 
paired P3MC samples from AIDS-KS patients (A) and 
25 homosexual /bisexual AIDS patients without KS (B) . 

Time 0 is the date of KS onset for cases cr other 
AlDS-defining illness for controls. All samples 
were randomized and examined blindly. Overall, 
7 of the KS patients were KSKV positive at both 
30 examination dates (solid bars) and 5 converted 

from a negative to positive F3MC sample (forward 
striped bars) immediately prior to or after KS 
onset. Two previously positive KS patients were 
neaative after KS diagnosis (reverse striped 
35 bars) and the remaining KS patients were negative 

at both timepomts (open bars) . Two 
homosexual/bisexual control F3MC samples without 
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KS converted from negative to positive and one 
control patient reverted from PGR positive to 
negative for KSHV DNA. 

5 Figure 21: 

Sample collection characteristics for AIDS - KS 
patients, gay/bisexual AIDS patients and 
hemophilic AIDS patients. 

10 Figure 22: 

PCR analysis of KS330 233 in DNA samples from 
patients with Kaposi's sarcoma and tumor 
controls . 

15 Figure 23: 

Characteristics of the study population of 
patients with KS and without KS . 

Figure 24: 

2 0 Prevalence of antibody to KSHV p4 0 in HIV-1 

nositive patients with and without KS . 

Figure 25: 

Comparison of KS patients with and without 
antibody to KSHV p40. 



3C 



Figure 26 : 

Prevalence of antibody detectable by indirect 
immunofluorescence to KSHV antigens in chemically 
induced BCBL-1 cells in HIV-1 positive patients 
with and without KS . 

Figures 27A-27B: 

Specific recognition of KSHV polypeptides in 
chemically treated BCBL-1 ceils. Figure 27A 
shows reactivity of untreated BC3L-1 and B95-8 
cells with RM, a reference human antibody to E3V. 
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RM recognizes the EBV polypeptides E3NA1 and p21 
ir. the BCBL-1 cells. Figure 273 shows reactivity 
of untreated and chemically treated cells witn 
serum 01-03 from a patient with KS . Cells were 
created with TP A and n-butyrate for 46 hrs . For 
description of the cell lines see Materials and 
Methods. The immunoblots were prepared from 10% 
SDS Dolyacryiamide gels . 

Figures 28A-28D: 

Detection of KSHV p40 by sera from patients with 
KS. Extracts were prepared from BCBL-1 cells 
(containing KSHV and EBV) and Clone KH514-16 
cells (containing only EBV } that were unmduced 
or treated for 48 hrs with chemical inducing 
agents, n-butyrate, TPA, or a combination of the 
-wo chemicals. Immunoblots prepared from 12% SDS 
pclyacryiamide gels were reacted with a 1:200 
dilution of serum from HIV-1 positive patients. 
Figure 28A shows serum 01-06 from a patient with 
KS. Figure 263 shows serum 01-07 from a patient 
without KS. Figure 28C shows serum C4-01 from a 
pa-ient with KS . Figure 28D shows serum 01-03 
from a patient with KS . 

Figures 29A-29F: 

Detection of KSHV lytic cycle antigens by 
indirect immunofluorescence. BC3L-1 cells were 
untreated (Figures 29A, 29C, and 29E) or treated 
with n-butyrate (Figures 293, 29D, and 29F) for 
48 hrs. Indirect immunofluorescence with a 1:10 
dilution of serum from two patients with KS , 04- 
18 (Figures 29A, and 293) and 04-35 (Figures 29E, 
and 29F) and a serum, 04-37 (Figures 29C, and 
29D) , from a patient without KS . 
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DETAILED DESCRIPTION OF T TTT: INVENTION 

Definitions 

5 The following standard abbreviations are used 

throughout the specification to indicate specific 
nucleotides : 

C= cytos ine A=adenosine 
10 T= thymidine G=guanosine 

The term "nucleic acids", as used herein, refers to 
either DNA or RNA. "Nucleic acid sequence" or 
"polynucleotide sequence" refers to a single- or 
15 double -stranded polymer of deoxyribonucleot ide or 

ribonucleotide bases read from the 5' to the 3' end. 
It includes both self -replicating plasmids, infectious 
polymers of DNA or RNA and nonfunctional DNA or RNA. 

2 0 3y a nucleic acid sequence "homologous to" or 

"complementary to", it is meant a nucleic acid that 
selectively hybridizes, duplexes or binds to viral DNA 
sequences encoding proteins or portions thereof when 
the DNA sequences encoding the viral protein are 

25 present in a human genomic or cDNA library. A DNA 

sequence which is homologous to a target sequence can 
include sequences which are shorter or longer than the 
target sequence so long as they meet the functional 
test set forth. Hybridization conditions are 

30 specified along with the source of the CDNA library. 

Typically, the hybridization is done in a Southern 
blot protocol using a 0.2XSSC, 0.1% SDS, €5°C wash. 
The term "SSC" refers to a citrate-saline solution of 
2 5 0.15 M sodium chloride and 20 Mm sodium citrate. 

Solutions are often expressed as multiples or 
fractions of this concentration. For example, 6XSSC 
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refers to a solution having a sodium chloride and 
sodium citrate concentration of 6 times this amount cr 
0.S M sodium chloride and 120 mM sodium citrate. 
0.2XSSC refers to a solution 0.2 times the 
5 concentration or 0.03 M sodium chlcride and 4 

sodium citrate. 



SSC 



~he chrase "nucleic acid molecule encoding" rerers tc 
a nucleic acid molecule which directs the expression 

10 of a specific protein or peptide. The nucleic acid 

sequences include both the DNA strand sequence that is 
transcribed into RKA and the RKA sequence that is 
translated into protein. The nucleic acid molecule 
include both the full length nucleic acid sequences as 

15 well as non-full length sequences derived from the 

full length protein. It being further understood that 
the sequence includes the degenerate codcns of the 
native sequence or sequences which may be introduced 
to provide codon preference in a specific host cell. 

The phrase "expression cassette", refers tc nucleotide 
sequences which are capable of affecting expression of 
a structural gene in hosts compatible with such 
sequences. Such cassettes include at least promoters 
25 and optionally, transcription termination signals. 

Additional factors necessary or helpful in effecting 
expression may also be used as described herein. 

The term "operably linked" as used herein refers to 
3C linkage of a promoter upstream from a DNA sequence 

such that the promoter mediates 
DNA sequence . 



transcription of the 



The term "ve 



ctor", refers to viral expression systems, 
autonomous sVlf - replicating circular DNA (plasmids) , 
and includes both expression and nonexpressior. 
plasmids. Where a recombinant microorganism or ce^l 
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culture is described as hosting an "expression 
vector," this includes both extra chromosomal circular 
DNA and DNA that has been incorporated into the host 
chromosome (s) . Where a vector is being maintained by 
5 a host cell, the vector may either be stably 

retlicated by the cells during mitosis as an 
autonomous structure, or is incorporated within the 
host's genome. 

10 The term "plasmid" refers to an autonomous circular 

DNA molecule capable of replication in a cell, and 
includes both the expression and nonexpression types. 
Where a recombinant microorganism or cell culture is 
described as hosting an "expression plasmid", this 

15 includes latent viral DNA integrated into the host 

chromosome (s) . Where a plasmid is being maintained by 
a host cell, the plasmid is either being stably 
replicated by the cells during mitosis as an 
autonomous structure or is incorporated within the 

2 c host ' s genome . 

The phrase "recombinant protein" or " recombinant ly 
produced protein" refers to a peptide or protein 
produced using non-native cells that do not have an 

2 5 endogenous copy of DNA able to express the protein. 

The cells produce the protein because they have been 
aeneticaliy altered by the introduction of the 
aporocriate nucleic acid sequence. The recombinant 
orotein will not be found in association with proteins 

3C and other subcellular components normally associated 

with the cells producing the protein. 

The following terms are used to describe the sequence 
relationships between two or more nucleic acid 
25 molecules or polynucleotides: "reference sequence", 

"comparison window" , » sequence identity " , "percentage 
cf sequence identity", and "substantial identity". A 
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"reference sequence" is a defined sequence used as a 
basis for a sequence comparison; a reference sequence 
may be a subset of a larger sequence, for example, as 
a segment of a full-length cDNA or gene sequence giver, 
m a sequence listing or may comprise a complete cDKA 
cr aene sequence . 

Optimal alignment of sequences for aligning a 
comparison window may be conducted by the -oca- 
homology algorithm of Smith and Waterman (1951) Adv. 
Add!. Math. 2:482, by the homology alignment algorithm 
of" Needleman and Wunsch (1970) J. Mol . Bioi . 46:443, 
by the search for similarity method of Pearson and 
Lipman (1986) Proo . Natl. Acad. Sci . (USA) e5:2444, or 
by computerized implementations of these algorithms 
(GAP, 3ESTFIT, FAST A , and T FAST A in the .Wisconsin 
Genetics Software Package Release 7.0, Genetics 
Computer Group, 575 Science Dr., Madison, WD. 

As applied to polypeptides, the terms "substantial 
identity" or "substantial sequence identity" mean that 
-wo peptide sequences, when optimally aligned, such as 
by the programs GAP or BESTFIT using default gap which 
share at least 90 percent sequence identity, 
preferably at least 95 percent sequence identity, more 
preferably at least 99 percent sequence identity or 
more . 

"Percentage amino acid identity" or "percentage amino 
acid sequence identity" refers to a comparison of the 
amino acids of two polypeptides which, when optimally 
aligned, have approximately the designated percentage 
cf the same amino acids. For example, "55% amino acid 
identity" refers to a comparison of the amino acics or 
cwo polypeptides which when optimally aligned have 95% 
amino acid identity. Preferably, residue positions 
which are not identical differ by conservative amine 
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acid substitutions. For example, the substitution of 
amino acids having similar chemical properties such as 
charge or polarity are not likely to effect tne 
properties of a protein. Examples include glutatr.ine 
5 for asparagine or glutamic acid for aspartic acid. 

The phrase "substantially purified" or "isolated" when 
referring to a herpesvirus peptide or protein, means 
a chemical composition which is essentially free of 
10 other cellular components. It is preferably in a 
homogeneous state although it can be in either a dry 
or aqueous solution. Purity and homogeneity are 
typically determined using analytical chemistry 
techniques such as polyacrylamide gel electrophoresis 
15 or high performance liquid chromatography. A protein 

which is the predominant species present in a 
preparation is substantially purified. Generally, a 
substantially purified or isolated protein will 
comprise more than 80% of all macromclecular species 
2 0 present in the preparation. Preferably, the protein 
is purified to represent greater than 50% of all 
macromolecular species present. More preferably the 
protein is purified to greater than 95%, and most 
preferably the protein is purified to essential 

2 5 homogeneity, wherein other macromolecular species are 

net detected by conventional techniques. 

The phrase "specifically binds to an antibody" or 
"specifically immunoreactive with",, when referring to 

3 0 a protein or peptide, refers to a binding reaction 

which is determinative of the presence of the 
herpesvirus of the invention in the presence of a 
heterogeneous population of proteins and other 
bioloaics including viruses other than the 
heroesvirus. Thus, under designated immunoassay 
conditions, the specified antibodies bind to the 
herpesvirus antigens and do not bind in a significant 
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amount to other antigens present in the sample. 
Specific binding to an antibody under such conditions 
may require an antibody that is selectee rcr its 
specificity for a particular protein. For example, 
5 antibodies raised to the human herpesvirus immunogen 

described herein can be selected to obtain antibodies 
specifically immuncreact ive ' with the herpesvirus 
oroteins and net with other proteins. Tnese 
antibodies recognize proteins homologous to the human 
10 herpesvirus protein. A variety of immunoassay formats 

may be used to select antibodies specifically 
immunoreactive with a particular protein. For 
example, solid-phase EL ISA immunoassays are routinely 
used to select monoclonal antibodies specifically 
immuncreactive with a protein. See Harlow and Lane 
[32] for a description of immunoassay formats and 
conditions that can be used to determine specific 
immunoreactivity . 

"Biological sample" as used herein refers to any 
sample obtained from a living organism or from an 
organism that has died. Examples of biological 
samples include body fluids and tissue specimens. 

z . k^sis's Sarcoma <KS) - Associated H^pesvirus. 

This invention provides an isolated DNA molecule which 
is at least 30 nucleotides in length and which 
uniquely defines a herpesvirus associated with 
3 0 Kaposi's sarcoma. 

In one embodiment the isolated DNA molecule comprises 
at least a portion of the nucleic acid sequence as 
shown in Figure 3A {SEQ ID NO: 1). In another 
35 embodiment the isolated DNA molecule is a 330 base 

pair (bp) sequence. In another embodiment the 
isolated DNA molecule is a 12-50 bp sequence. In 
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another embodiment the isolated DNA molecule is a 30- 
37 bp sequence. 

In another embodiment the isolated DNA molecule is 
genomic DNA. In another embodiment the isolated DNA 
molecule is cDNA . In another embodiment a RNA is 
derived form the isolated nucleic acid molecule cr is 
capable of hybridizing with the isolated DNA molecule. 
As used herein "genomic" means both coding and non- 
codina reaions of the isolated nucleic acid molecule. 



Further, the DNA molecule above may be associated with 
lymphoprolif erative diseases including, but not 
limited to: Hodgkin's disease, non-Kodgkin ' s lymphoma, 

15 lymphatic leukemia, lymphosarcoma, splenomegaly, 

reticular cell sarcoma, Sezary's syndrome, mycosis 
fungoides, central nervous system lymphoma, AIDS 
related central nervous system lymphoma, post- 
transplant lymphoprolif erative disorders, and 

20 Burkitt's lymphoma. A lymphoprolif erative disorder is 

characterized as being the uncontrolled clonal or 
polyclonal expansion of lymphocytes involving lymph 
nodes, lymphoid tissue and other organs. 

25 This invention provides an isolated nucleic acid 

molecule encoding an ORF20 (SEQ ID NOs : 22 and 23), 
CRF2 1 ( SEQ ID NOs : 14 and 15), ORF22 { SEQ ID NOs : 16 and 
17), ORF23 ( SEQ ID NOs:18 and 19), ORF24 (SEQ ID NOs: 
20 and 21), ORF25 {SEQ ID NOs : 2 and 3), ORF26 (SEQ ID 

30 NOs:24 and 25), ORF27 (SEQ ID NOs:26 and 27), ORF28 

{ SEQ ID NOs:28 and 29), ORF29A (SEQ ID NOs:30 and 31), 
0RF2S3 ( SEQ ID NOs : 4 and 5), ORF3 0 ( SEQ ID NOs : 6 and 
7), ORF31 (SEQ ID NOs : 8 and 9), ORF32 (SEQ ID NOs:32 
and 33), ORF3 3 (SEQ ID NOs : 10 and 11), ORF34 (SEQ ID 

35 NOs: 34 and 35), or ORF35 (SEQ ID NOs : 12 AND 13). 
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This invention provides an isolated polypeptide 
encoded by ORF20 (SEQ ID NOs : 22 and 23), CRT 2 1 (SEQ 
ID NOs : 14 and 15), ORF22 ( SEQ ID NOs : 16 and 17), ORF23 
(SEQ ID NOs :1B and 19), OR? 2 4 (SEQ ID NOs : 20 and 21) , 
ORF25 (SEQ ID NOs : 2 and 3), ORF26 (SEQ ID NOs: 24 and 
25;, ORF27 { SEQ ID NOs : 26 and 27), ORF25 (SEC IE 
NOs: 26 and 23), ORF29A (SEQ ID NOs : 3 0 and 31), ORF25S 
;SEQ ID NOs : 4 and 5), ORF30 (SEQ ID NOs : 6 and 1), 
ORF21 (SEQ ID NOs : 8 and 9), ORF32 ( SEQ ID NOs: 32 and 
33) , ORF33 (SEQ ID NOs : 10 and 11) , ORF34 (SEQ ID NOs : 
34 and 35), or ORF35 (SEQ ID NOs : 12 AND 13) . 

For Example, TK is encoded by ORE 21; glycoprotein H 
tgH) by ORF 22; major capsid protein (MCF) by ORF 25; 
virion polypeptide (VP23) by ORF 26; and minor capsid 
crotein by ORF 27. 

This invention provides for a replicabie vector 
comprising the isolated DNA molecule of the DNA virus. 
The vector includes, but is not limited to: a plasmid, 
cosmic, X phage or yeast artificial chromosome (YAC) 
which contains at least a portion of the isolated 
nucleic acid molecule. 

As an example to obtain these vectors, insert and 
vector DNA can both be exposed to a restriction enzyme 
to create complementary ends on both molecules which 
base pair with each other and are then ligated 
together with DNA ligase. Alternatively, linkers can 
be ligated to the insert DNA which correspond to a 
restriction site in the vector DNA, which is then 
digested with the restriction enzyme which cuts at 
that site. Other means are also available and known 
to an ordinary skilled practitioner. 

Recrulatcry elements required for expression include 
promoter cr enhancer sequences to bind RICA polymerase 
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and transcription initiation sequences for ribosome 
binding. For example, a bacterial expression vector 
includes a promoter such as the lac promoter and for 
transcription initiation the Shine -Dalgarno sequence 
and the start codon AUG. Similarly, a eukaryotic 
expression vector includes a heterologous or 
homologous promoter for RNA polymerase 11, a 
downstream poiyadenylat ion signal, the start cooon 
AUG, and a termination codon for detachment of the 
ribosome. Such vectors may be obtained commercially 
or assembled from the sequences described by methods 
well-known in the art, for example the methods 
described above for constructing vectors in general. 

15 This invention provides a host cell containing the 

above vector. The host cell may contain the isolated 
DNA molecule artificially introduced into the host 
ceil. The host cell may be a eukaryotic or bacterial 
ceil (such as E.coli ) , yeast cells, fungal cells, 

20 insect cells and animal cells. Suitable animal cells 

include, but are not limited to Vero cells, HeLa 
cells, Cos cells, CVl cells and various primary 
mammalian cells . 

25 This invention provides an isolated herpesvirus 

associated with Kaposi's sarcoma. In one embodiment 
the herpesvirus comprises at least a portion of a 
nucleotide sequence as shown in Figures 3A (SEQ ID NO: 
1) . 



a 



In one embodiment the herpesvirus may be a DNA virus 
In another embodiment the herpesvirus may be 
Herpesviridae. In another embodiment the herpesvirus 
may* be a gammaherpesvirinae . The classification of 
the herpesvirus may vary based on the phenotypic or 
molecular characteristics which are known to those 
skilled in the art. 
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This invention provides an isolated DNA virus wherein 
the viral DNA is about 270 kb in size, wherein the 
viral DNA encodes a thymidine kinase, and wherein the 
viral DNA is capable of selectively hybridizing to a 
nucleic acid probe selected from the group consisting 
cf SEQ ID NOs: 36-40 . 

Th£ KS- associated human herpesvirus of the invention 
is associated with KS and is involved in the etiology 
cf the disease. The taxonomic classification cz tne 
virus has not yet been made and will be based on 
phenotypic or molecular characteristics known to those 
of skill in the art. However, the novel KS - associated 
virus is a DNA virus that appears to be related to the 
Kerpesviridae family and the gammaherpesvirinae 
subfamily, on the basis of nucleic acid homology. 

A . QP ^iPnre identity o f t-h- viral nNA and its 
proteins . 

The human herpesvirus of the invention is not limited 
re" 'the virus having the specific DNA sequences 
described herein. The KS-associated human 
herpesvirus DNA shows substantial sequence identity, 
as defined above, to the viral DNA sequences described 
herein. DNA from the human herpesvirus typically 
selectively hybridizes to one or more of the following 
rhree nucleic acid probes: 

Probe I (SEQ ID NO:38) 

AGCCGAAAGG ATTCCACCAT TGTGCTCGAA TCCAAGGGAT TTGACCCCGT 
GTTCCCCATG GTCGTGCCGC AGCAACTGGG GCACGCTATT CTGCAGCAGC 
T3TTGGTGTA CCACATCTAC TCCAAAATAT CGGCCGGGGC CCCGGATGAT 
gtIIatatgg CGGAACTTGA TCTATATACC ACCAATGTGT CATTTATGGG 
GCGCACATAT CGTCTGGACG TAGACAACAC GGA 
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Probe 2 (SEQ ID NO:39) : 

GAAATTACCC ACGAGATCGC TTCCCTGCAC ACCGCACTTG GCTACT CATC 
AGTCATCGCC CCGGCCCACG TGGCCGCCAT AACTACAGAC ATGGGAGTAC 
ATTGTCAGGA CCTCTTTATG ATTTTCCCAG GGGACGCGTA TCAGGACCGC 
CAGCTGCATG ACT AT AT CAA AATGAAAG CG GGCGTGCAAA CCGGCTCACC 
G G G AAA C AG A ATGGATCACG TGGGAT AC A C TGCTGGGGTT CCTCGZTGZG 
ACAACCTGCC CGGTTTGAGT CATGGTCAGC TGGCAACCTG CGAGATAATT 
CCCACGCCGG TCACATCTGA CGTTGCCT 
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Probe 3 (SEQ ID NO: 40) : 

AACACGTCAT GTGCAGGAGT GACATTGTGC CGCGGAGAAA CTCAGACCGC 
ATCCCGTAAC CACACTGAGT GGGAAAATCT GCTGGCTATG TTTTCTGTGA 
TTATCTATGC CTTAGATCAC AACTGTCACC CG 

Hybridizatiion of a viral DNA to the nucleic acid 
probes listed above is determined by using standard 
nucleic acid hybridization techniques as described 
herein. In particular, PCR amplification of a viral 
aenome can be carried out using the following three 
sets of PCR primers: 



— j 



AG C C G AAAG G ATT C C A C CAT ; 

TCCGTGTTGTCTACGTCCAG (SEQ ID NO : 41) 



2 ) G AAATT AC CCACG AG AT CGC ; 

AGGCAACGTCAGATGTGA { SEQ ID NO: 42; 

30 3 ) AACACGTCATGTGCAGGAGTGAC ; 

CGGGTGACAGTTGTGATCTAAGG (SEQ ID NO : 4 3 ) 



35 In PCR techniques, oligonucleotide primers, as listed 

above, complementary to the two 3' borders of the DNA 
recion to be amplified are synthesized. The 
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polymerase chain reaction is then carried out using 
the two primers. See PCR Protocols: A Guide to 
Methods and Applications [74] . Following PCR 

amplification, the ?CR-amplif ied regions of a viral 
5 DNA can be tested for their ability to hybridize tc 

the three specific nucleic acid probes listed above. 
Alternatively, hybridization -cf a vira. DNA to tne 
above nucleic acid probes can be performed by a 
Southern blot procedure without viral DKA 
amplification and under stringent hybridization 
conditions as described herein. 

Oligonucleotides for use as probes or PCR primers are 
chemically synthesized according tc the solid phase 
phosphoramidite triester method first described by 
Beaucage and Carruthers [19] using an automated 
synthesizer, as described in Needham- VanDevanter [6 9] . 
Purification of oligonucleotides is by either native 
a cry! amide gel electrophoresis or by anion- exchange 
KPLC as described in Pearson, J.D. and Regnier, F.E. 
;75Aj . The sequence of the synthetic oligonucleotide 
can be verified using the chemical degradation method 
of Kaxam, A.M. and Gilbert, W. [63] . 

B. ' -relation and pro rogation cf KS-inducinq 

strains o J the Hum an «orpesvirus 

Using conventional methods, the human herpesvirus can 
be propagated in vitro . For example, standard 
techniques for growing herpes viruses are described in 
Abiashi, D.V. [1] . Briefly, ?KA stimulated cord blood 
mononuclear cells, macrophage, neuronal, or glial cell 
lines are cocultivated with cerebrospinal fluid, 
plasma, peripheral blood leukocytes, or tissue 
extracts containing viral infected ceils or purified 
virus. The recipient cells are treated with 5 jig/ml 
oolvbrene for 2 hours at 37° C prior to infection. 



2 0 



"a " 



WO 96/06159 



FCI7US95/10194 



26 

Infected cells are observed by demonstrating 
morphological changes, as well as being positive for 
antigens from the human herpesvirus by using 
monoclonal antibodies immunoreact ive with the human 
5 heroes virus in an immunofluorescence assay. 

For virus isolation, the virus is either harvested 
directly from the culture fluid by direct 
centrifugation, or the infected cells are harvested, 
10 homogenized or lysed and the virus is separated from 

cellular debris and purified by standard methods of 
isopycnic sucrose density gradient centrifugation. 

One skilled in the art may isolate and propagate the 
DNA herpesvirus associated with Kaposi's sarcoma 
(KSHV) employing the following protocol. Long-term 
establishment of a B lymphoid cell line infected with 
the KSHV from body- cavity based lymphomas (RCC-1 or 
3KL-6) is prepared extracting DNA from the Lymphoma 
tissue using standard techniques [27, 49, 66] . 

The KS associated herpesvirus may be isolated from the 
cell DNA in the following manner. An infected cell 
line (BHL-6 RCC-1) , which can be lysed using standard 
methods such as hyposomatic shocking and Dounce 
homogenization, is first pelleted at 2000xg for 10 
minutes, the supernatant is removed and centrifuged 
aaain at 10,000xg for 15 minutes to remove nuclei and 
organelles. The supernatant is filtered through a 
0.45^ filter and centrifuged again at 100,000xc for 1 
hour to pellet the virus. The virus can then be 
washed and centrifuged again at 100 , OOOxc for 1 hour. 

The DNA is tested for the presence of the KSKV by 
Southern blotting and PCR using the specific probes as 
described hereinafter. Fresh lymphoma tissue 

containing viable infected cells is simultaneously 
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filtered to form a single cell suspension by standard 
techniques [49, €6]. The ceils are separated by 
standard Ficcil - Plague centrif ugation and lymphocyte 
layer is removed. The lymphocytes are then placec at 
5 >Ixl 0 C cells/ml into standard lymphocyte tissue culture 

medium, such as RM? 1640 supplemented with 10% fetal 
calf serum. Immortalized lymphocytes containing tne 
KSKV virus are indefinitely grown m the culture mec-a 
while nonimmortilized ceils die during course of 
IG prolonged cultivation. 

Further, the virus may be propagated in a new cell 
line by removing media supernatant containing the 
virus from a continuously infected ceil line at a- 
concentration of >lxl0* ceils/ml. The media is 
centrifuged at 2000xg for 10 minutes and filtered 
through a 0.45^ filter to remove cells. The media is 
applied in a 1 : 1 volume with cells growing at >lxl0 6 
ceils/mi for 48 hours. The cells are washed and 
pelleted and placed in fresh culture medium, and 
tested after 14 days of growth. 

RCC-1 and RCC-1 2FE were deposited on October 19, 1994 
under ATCC Accession No. CRL 11734 and CRL 11735, 
respectively, pursuant to the Budapest Treaty on the 
International Deposit of Microorganisms for the 
Purposes of Patent Procedure with the Patent Culture 
Depository of the American Type Culture Collection, 
123C1 Parklawn Drive, Rockville, Maryland 20S52 U.S.A. 

BKL-6 was deposited on November 16, 1994 under ATCC 
Accession No. CRL 11762 pursuant to the Budapest 
Treaty on the International Deposit of Microorganisms 
for the Purposes of Patent Procedure with the Patent 
Culture Depository of the American Type Culture 
Collection, 12301 Parklawn Drive, Rockville, Maryland 
20852 U.S.A. 
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C. Immunological Ide nr^ ry of the Virus 

The KS-associated human herpesvirus can also oe 
described immunologically. KS-associated human 

herpesviruses are selectively immunoreact ive to 
antisera generated against a defined immunogen su:r. as 
the viral major capsid protein depicted in See. ID Nc . 
12, herein. Immunoreact ivity is determined m an 
immunoassay using a polyclonal antiserum which was 
raised to the protein which is encoded by the amino 
acid sequence or nucleic. acid sequence of SEQ ID NOs : 
18-20. This antiserum is selected to have low 
crossreactivity against other herpes viruses and any- 
such crossreactivity is removed by immunoabsorbtion 
prior to use in the immunoassay. 

In order to produce antisera for use in an 
immunoassay, the protein which is encoded by the amino 
acid sequence or nucleic acid of SEQ ID NOs: 18-20 is 
20 isolated as described herein. For example, 

recombinant protein can be produced in a mammalian 
cell line. An inbred strain of mice such as balb/c is 
immunized with the protein which is encoded by the 
amine acid sequence or nucleic acid of SEQ ID NOs: 2- 
25 37 using a standard adjuvant, such as Freund's 

adiuvant, and a standard mouse immunization protocol 
isee [32] , supra) . Alternatively, a synthetic peptide 
derived from the sequences disclosed herein and 
conjugated to a carrier protein, can be used an 
3 0 immunogen. Polyclonal sera are collected and titered 

against the immunogen protein in an immunoassay, for 
example, a solid phase immunoassay with the immunogen 
immobilized on a solid support. Polyclonal antisera 
with a titer of 10* or greater are selected and tested 
35 fcr their cross reactivity against other viruses of 

the gammaherpesvirinae subfamily, particularly human 
heroes virus types 1-7, by using a standard 
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immunoassay as described in [32] , supra. These o:ner 
gammaherpesvirinae virus can be isolated by standard 
techniques for isolation herpes viruses as describee 
^isre in 

The ability of the above viruses to compete with tne 
binding of the antisera to the immunogen protein is 
determined. The percent crossreactivity for ctner 
viruses is calculated, using standard calculations. 
Those antisera with less than 10% crossreactivity with 
each of the other viruses listed above is selected and 
pooled. The cross-reacting antibodies are then 
removed from the pooled antisera by immunoabsorption 
with the above- listed viruses. 



The immunoabsorbed and pooled antisera are then used 
in a competitive binding immunoassay procedure as 
described above to compare an unknown virus 
preparation to the specific KS herpesvirus preparation 
2- described herein and containing the nucleic acid 

sequence described in SEQ ID NOs : 2-37. In order to 
make this comparison, the immunogen protein which is 
encoded by the amino acid sequence or nucleic acid of 
SEQ ID NOs: 2-37 is the labeled antigen and the virus 
25 preparations are each assayed at a wide range of 

concentrations. The amount of each virus preparation 
required to inhibit 50% of the binding of the antisera 
to the labeled immunogen protein is determined. Those 
viruses that specifically bind . to an antibody 
generated to an immunogen consisting of the protein of 
SEQ ID NOs : 2-37 are those virus where the amount of 
virus needed to inhibit 50% of the binding to the 
protein does not exceed an established amount. This 
amount is no more than 10 times the amount of the 
virus that is needed for 50% inhibition for the KS- 
associated herpesvirus containing the DNA sequence of 
SEQ ID NO: 1. Thus, the KS - associated herpesviruses 
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f the invention can be defined by immunologics! 
comparison to the specific strain of the KS-associated 
herpesvirus for which nucleic acid sequences are 
provided herein. 

This invention provides, a nucleic acid molecule cf at 
least 14 nucleotides capable of specifically 
hybridizing with the isolated DNA molecule. In one 
embodiment, the molecule is DNA. In another 

embodiment, the molecule is RKA. In another 

embodiment the nucleic acid molecule may be 14-20 
nucleotides in length. In another embodiment the 
nucleic acid molecule may be 16 nucleotides in length. 

This invention provides, a nucleic acid molecule of at 
least 14 nucleotides capable of specifically 
hybridizing with a nucleic acid molecule which is 
complementary to the isolated DNA molecule. In one 
embodiment, the molecule is DNA. In another 

embodiment, the molecule is RNA. 

The nucleic acid molecule of at least 14 nucleotides 
may hybridize with moderate stringency to at least a 
portion of a nucleic acid molecule with a sequence 
shown in Figures 3A-3F (SEQ ID NOs : 1, 10-17, and 36- 
40) . 



High stringent hybridization conditions are selected 
at about 5° C lower than the thermal melting point 
30 (Tm) for the specific sequence at a defined ionic 

strength and pH . The Tm is the temperature (under 
defined ionic strength and pH) at which 50% of the 
target sequence hybridizes to a perfectly matched 
probe. Typically, stringent conditions will be those 
in which the salt concentration is at least about 0.02 
molar at pH 7 and the temperature is at least about 
60°C. As other factors may significantly affect the 
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■stringency of hybridization, including, among others, 
base composition and size of the complementary 
strands, the presence of organic solvents, ie . salt 
or fcrmamide concentration, and the extent ct oase 
5 mismatching, the combination of parameters is more 

important than the absolute measure cf any one. Fcr 
Example high stringency may be attained fcr example by 
cverniaht hybridization at about 68°C in a €x SSC 
solution, washing at room temperature with 6x SSC 
10 solution, followed by washing at about 6e°C ir. a €x 

SSC in a 0 . 6x SSX solution. 
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Hybridization with moderate stringency m Q y be attame 
for example by: 1) filter pre - hybridizing and 
hybridizing with a solution of 3x sodium chloride, 
sodium citrate (SSC) , 50% formamide, Tris buffer 

at Ph 7.5, 5x Denhardt's solution; 2.) pre- 

hybridization at 37°C for 4 hours; 3) hybridization 
at 37°C with amount of labelled probe equal to 
3,000,000 cpm total for 16 hours; 4) wash in 2x SSC 
and 0.1% SDS solution; 5) wash 4x for 1 minute each 
at room temperature at 4x at 60°C for 30 minutes each; 
and 6) dry and expose to film. 

The phrase- "selectively hybridizing to' 1 refers to a 
nucleic acid probe that hybridizes, duplexes or binds 
only to a particular target DNA or RNA sequence when 
-he target sequences are present in a preparation of 
total cellular DNA or RNA. 3y selectively hybridizing 
it is meant that a probe binds to a given target in a 
manner that is detectable in a different manner from 
non-target sequence under high stringency conditions 
cf hybridization, in a different "Complementary" or 
"target" nucleic acid sequences refer to those nucleic 
acid' sequences which selectively hybridize to a 
nucleic acid probe. Proper annealing conditions 
depend, for example, upon a probe's length, base 
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composition, and the number of mismatches and their 
position on the probe, and must often be determined 
empirically. For discussions of nucleic acid probe 
design and annealing conditions, see, for example, 
5 Sambrook et al . , [81] or Ausubel, ?., en al., [6]. 

It will be readily understood by those skilled in the 
art and it is intended here, that when reference is 
made to particular sequence listings, such reference 
10 includes sequences which substantially correspond to 

its complementary sequence and those described 
including allowances for minor sequencing errors, 
sinale base changes, deletions, substitutions and the 
like, such that any such sequence variation- 
IB corresponds to the nucleic acid sequence of the 
pathogenic organism or disease marker to which the 
relevant sequence listing relates. 

Nucleic acid probe technology is well known to those 
2G skilled in the art who readily appreciate that such 

probes may vary greatly in length and may be labeled 
with a detectable label, such as a radioisotope or 
fluorescent dye, to facilitate detection of the probe. 
DNA probe molecules may be produced by insertion of a 

2 5 DNA molecule having the full-length or a fragment of 

the isolated nucleic acid molecule of the DNA virus 
into suitable vectors, such as plasmids or 
bacteriophages, followed by transforming into suitable 
bacterial host cells, replication in the transformed 

3 0 bacterial host cells and harvesting of the DNA probes, 

using methods well known in the art. Alternatively, 
probes may be generated chemically from DNA 
synthesizers . 

35 DNA virus nucleic acid rearrangements/mutations may be 

detected by Southern blotting, single stranded 
conformational polymorphism gel electrophoresis 
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(SSCP) , ?CR or other DNA based techniques, cr for RNA 
species by Northern blotting, PCR cr other RNA-based 
techniques . 

5 RNA probes may be generated by inserting the full 

lenath or a fragment of the isolated nucleic acid 
molecule of the DNA virus downstream of a 
bacteriophage promoter such as T3 , 77 or S?6 . Large 
amounts of RNA probe may be produced by incubating the 
IC labeled nucleotides with a linearized isolated nucleic 

acid molecule of the DNA virus cr its fragment where 
it contains an upstream promoter in the presence of 
the appropriate RNA polymerase. 

15 As defined herein nucleic acid probes may be DNA or 

RNA fragments. DNA fragments can be prepared, for 
example, by digesting plasmid DNA, or by use cf PCR, 
or synthesized by either the phosphoramidite method 
described by Beaucage and Carruthers, [IS] , or by the 

20 triester method according to Matteucci, er ai . , [62], 

both incorporated herein by reference. A double 
stranded fragment may then be obtained, if desired, by 
annealing the chemically synthesized single strands 
together under appropriate conditions or by 

2 5 synthesizing the complementary strand using DNA 

polymerase with an appropriate primer sequence. Where 
a specific sequence for a nucleic acid probe is given, 
it is understood that the complementary strand is also 
identified and included. The complementary strand 

3 0 will work equally well in situations where the target 

is a double -stranded nucleic acid. It is also 
understood that when a specific sequence is identified 
for use a nucleic probe, a subsequence cf the listed 
sequence which is 25 basepairs or more in length is 
3 5 also encompassed for use as a probe. 
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The DNA molecules of the subject invention also 
include DNA molecules coding for polypeptide analogs, 
fragments or derivatives of antigenic polypeptides 
which differ from naturally-occurring forms in terms 
of the identity or location of one or more amino acid 
residues (deletion analogs containing less than all of 
the residues specified for the protein, substitution 
analogs wherein one or more residues specified are 
replaced by other residues and addition analogs where 
in one or more amino acid residues is added to a 
terminal or medial portion of the polypeptides) and 
which share some or all properties of naturally- 
occurring forms. These molecules include: the 
incorporation of codons "preferred" for expression by 
selected non-mammalian hosts; the provision of sites 
for cleavage by restriction endonuclease enzymes; and 
the provision of additional initial, terminal or 
intermediate DNA sequences that facilitate 
construction of readily expressed vectors. 



This invention provides for an isolated DNA molecule 
which encodes at least a portion of a Kaposi's sarcoma 
associated herpesvirus: virion polypeptide 23, major 
capsid protein, capsid proteins, thymidine kinase, or 
2= tegument protein. 

This invention also provides a method of producing a 
polypeptide encoded by isolated DNA molecule, which 
comprises growing the above host vector system under 
30 suitable conditions permitting production of tne 

polypeptide and recovering the polypeptide S o 
produced . 

This invention provides an isolated peptide encoded by 
35 the isolated DNA molecule associated with Kaposi's 

sarcoma. In one embodiment the peptide may be a 
polypeptide. Further, this invention provides a host 
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cell which expresses the polypeptide of isolated DNA 
molecule. 

In one embodiment the isolated peptide or polypeptide 
5 is encoded by at least a portion of an isolated DNA 

molecule. In another embodiment' the isolated peptide 
cr Dclypeptide is encoded by at least a portion c: a 
nucleic acid molecule with a sequence as set forth m 
(SEQ ID NOS: 2-37) . 

r, 

Further, the isolated peptide or polypeptide encoded 
by the isolated DNA molecule may be linked to a second 
nucleic acid molecule to form a fusion protein by 
expression in a suitable host cell. In one embodiment 
.5 the second nucleic acid molecule encodes beta- 

aalactosidase. Other nucleic acid molecules which are 
used to form a fusion protein are known to those 
skilled in the art. 

2 0 This invention provides an antibody which specifically 

binds to the peptide or polypeptide encoded by the 
isolated DNA molecule. In one embodiment the antibody 
:s a monoclonal antibody. In another embodiment the 
antibody is a polyclonal antibody. 

2 5 

The antibody cr DNA molecule may be labelled with a 
detectable marker including, but not limited to: a 
radioactive label, or a colorimetric , a luminescent, 
cr a fluorescent marker, or gold. Radioactive labels 
30 include, but are not limited to: 5 H , i4 C, ~P, JJ ?; "S, 

*-cr f 5, Co, 55 Co, 55 Fe, 50 Y, 125 I, "'1 . and 16 ^Re . 
Fluorescent markers include but are net limited to: 
fluorescein, rhodamine and auramine . Colorimetric 
markers include, but are not limited to: biotin, ana 
2 5 digoxigenin. Methods of producing the polyclonal or 

monoclonal antibody are known to those of ordinary 
skill in the art. 
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Further, the antibody or nucleic acid molecule complex 
may be detected by a second antibody which may be 
linked to an enzyme, such as alkaline phosphatase =r 
horseradish peroxidase. Other enzymes which may be 
5 employed are well known to one of ordinary skill ir. 

the art - 

This invention provides a method to select speziz i c 
regions on the polypeptide encoded by the isolated DNA 
molecule of the DNA virus to generate antibodies. 
The protein sequence may be determined from the cDNA 
sequence. Amino acid sequences may be analyzed by 
methods well known to those skilled in the art to 
determine whether they produce hydrophobic or 
hydrophilic regions in the proteins which they build. 
In the case of cell membrane proteins, hydrophobic 
regions are well known to form the part of the protein 
that is inserted into the lipid bilayer of the cell 
membrane, while hydrophilic regions are located on the 
cell surface, in an aqueous environment. Usually, the 
hydrophilic regions will be more immunogenic than the 
hydrophobic regions. Therefore the hydrophilic amino 
acid sequences may be selected and used to generate 
antibodies specific to polypeptide encoded by the 
25 isolated nucleic acid molecule encoding the DNA virus. 

The selected peptides may be prepared using 
commercially available machines. As an alternative, 
DNA, such as a cDNA or a fragment thereof, may be 
cloned and expressed and the resulting polypeptide 
3 0 recovered and used as an immunogen . 

Polyclonal antibodies against these peptides may be 
produced by immunizing animals using the selected 
peptides. Monoclonal antibodies are prepared using 
.5 hybridoma technology by fusing antibody producing 3 

ce us from immunized animals with myeloma ceils and 
selecting the resulting hybridoma ceil line producing 
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the desired antibody. Alternatively, monoclonal 
antibodies may be produced by in vitro technique- 
known to a person of ordinary skill in the art. These 
antibodies are useful to detect the expression of 
5 polypeptide encoded by the isolated DNA molecule of 

the DNA virus in living animals, in humans, or m 
biological tissues or fluids isolated from animals or 
humans . 



10 II. I mmunoa ssavs 

The antibodies raised against the viral strain or 
peptides may be detectably labelled, utilizing 
conventional labelling techniques- well-known to the 
15 art. Thus, the antibodies may be radiolabelied using, 

for example, radioactive isotopes such as 3 K, ~ 25 1 , 1 'I, 
and 3S S. 

The antibodies may also be labelled using fluorescent 
2C labels, enzyme labels, free radical labels, or 

bacteriophage labels, using techniques known in the 
art. Typical fluorescent labels include fluorescein 
isothiocyanate, rhodamine , phycoerythrin , phycooyanin, 
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alophycocyanin , 



and Texas Red. 



Since specific enzymes may be coupled to other 
molecules by covalent links, the possibility also 
exists that they might be used as labels for the 
production of tracer materials. .Suitable enzymes 
include alkaline phosphatase, beta-galactosidase , 
glucose -6 -phosphate dehydrogenase , maleate 
dehydrogenase, and peroxidase. Two principal types of 
enzyme immunoassay are the enzyme- linked immunosorbent 
assay (ELISA) , and the homogeneous enzyme immunoassay, 
also known as enzyme -multiplied immunoassay { EMIT , 
Syva Corporation, Palo Alto, CA) . In the ELISA 
system, separation may be achieved, for example, by 
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the use of antibodies coupled to a solid phase. The 
EMIT system depends on deactivation of the enzyme in 
the tracer-antibody complex; the activity can thus be 
measured without the need for a separation step. 

Additionally, chemi luminescent compounds may be usee 
as labels. Typical chemi luminescent compounds induce 
luminol, isoluminol, aromatic acridinium esters, 
imidazoles, acridinium salts, and oxalate esters. 
-C Similarly, bioluminescent compounds may be utilized 

for labelling, the bioluminescent compounds including 
luciferin, luciferase, and aeguorin. 

Once labeled, the antibody may be employed to identify 
and quantify immunologic counterparts (antibody or 
antigenic polypeptide) utilizing techniques well-known 
to the art . 

A description of a radioimmunoassay (RIA) may be found 
in Laboratory Techniques in Biochemistry and Molecular 
Biology [52] , with particular reference to the chapter 
entitled "An Introduction to Radioimmune Assay and 
Related Techniques" by Chard, T., incorporated by 
reference herein. 

A description of general immunometric assays of 
various types can be found in the following U.S. Pat. 
Nos. 4,376,110 (David et al . ) or 4,098,876 (Piasio) . 

A. Assavs for vira l antiaens 

In addition to the detection of the causal agent using 
nucleic acid hybridization technology, one can use 
immunoassays to detect for the virus, specific 
peptides, or for antibodies to the virus or peptides. 
A aeneral overview of the applicable technology is in 
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Harlow and Lane [32] , incorporated by reference 
herein . 

In one embodiment:, antibodies to the human herpesvirus 
5 can be used to detect the agent in the sample. In 

brief, to produce antibodies to the agent or peptides, 
the sequence being targeted is expressed in 
transfected ceils, preferably bacterial cells, and 
purified. The product is injected into a mammal 
10 capable of producing antibodies. Either monoclonal or 

polyclonal antibodies (as well as any recombinant 
antibodies) specific for the gene product can be used 
in various immunoassays. Such assays include 

competitive immunoassays, radioimmunoassays, Western 
blots, ELISA, indirect immunof luorescent assays and 
the like. For competitive immunoassays, see Harlow 
and Lane [32] at pages 567-573 and 584-589. 



Monoclonal antibodies or recombinant antibodies may be 
obtained by various techniques familiar to those 
skilled in the art. Briefly, spleen cells or other 
lymphocytes from an animal immunized with a desired 
antigen are immortalized, commonly by fusion with a 
myeloma cell (see, Kohler and Miistein [50], 
25 incorporated herein by reference) . Alternative 

methods of immortalization include transformation with 
Epstein Barr Virus, oncogenes, or retroviruses, or 
other methods well known in the art. Colonies arising 
from single immortalized ceils are screened for 
production of antibodies of the desired specificity 
and affinity for the antigen, and yield cf the 
monoclonal antibodies produced by such ceils may be 
enhanced by various techniques, including injection 
into the peritoneal cavity of a vertebrate host. New 
technioues using recombinant phage antibody expression 
systems can also be used to generate monoclonal 
antibodies. See for example: McCaf ferry, J ez al . 
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[64]; Hoogenboom, K . R. etal. [39]; and Marks , J.D. e = 
al. [60] . 

Such pep-ides may be produced by expressing the 
5 specific sequence in a recombinantly engineered ceil 

such as bacteria, yeas-, filamentous fungal, insect 
(especially employing bacuioviral vectors) , and 
mammalian cells. Those of skill in the art are 
knowledgeable in the numerous expression systems 
10 available for expression of herpes virus protein. 

Briefly, the expression of natural or synthetic 
nucleic acids encoding viral protein will typically be 
achieved by operably linking the desired sequence or 

15 portion thereof to a promoter (which is either 

constitutive or inducible) , and incorporated into an 
expression vector. The vectors are suitable for 
replication or integration in either prokaryotes or 
eukaryotes. Typical cloning vectors contain 

20 antibiotic resistance markers, genes for selection of 

transf crmants, inducible or regulatable promoter 
regions, and translation terminators that are useful 
for the expression of viral genes. 

25 Methods for the expression of cloned genes in bacteria 

are also well known. In general, to obtain high level 
expression of a cloned gene in a prokaryotic system, 
it is advisable to construct expression vectors 
containing a strong promoter -to direct mRNA 

30 transcription. The inclusion of selection markers in 

DNA vectors transformed in E . coli is also useful. 
Examples of such markers include genes specifying 
resistance to antibiotics. See [81] supra , for 
details concerning selection markers and promoters for 

35 use in E . coli. Suitable eukaryote hosts may include 

plant cells, insect cells, mammalian cells, yeast, and 
filamentous fungi. 
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Methods for characterizing naturally processed 
peptides bound to" MKC imajcr histocompatibility 
complex) I molecules have been developed. See, Falk 
ez al. [24], and PCX publication No. WO S2/21C33 
published November 26, 1952, both of which are 
incorporated by reference herein. Typically, these 
methods involve isolation of MKC class I molecules by 
irtmuncprecipitation or affinity chromatography rrom an 
accrocriate cell or ceil line. Other methods involve 
direct amino acid sequencing of the more abundant 
peptides in various KFLC fractions by known automatic 
sequencing of peptides eluted from Class I molecules 
of the B cell type (Jardetzkey, et al - [45], 
incorporated by reference herein, and of the human MKC 
class 1 molecule, HLA-A2 . 1 type by mass spectrometry 
(Kunt, et al. [40] , incorporated by reference herein) . 
See also, Rotzschke and Falk [79], incorporated by 
reference herein for a general review of the 
characterization of naturally processed peptides in 
MKC class I. Further, Marloes, et al . [61], 

incorporated by reference herein, describe how class 
I binding motifs can be applied to the identification 
c-f potential viral immunogenic peptides in vitro . 

The peptides described herein produced by recombinant 
technology may be purified by standard techniques well 
known to those of skill in the art. Recombinantly 
produced viral sequences can be directly expressed or 
expressed as a fusion protein. The protein is then 
purified by a combination of cell lysis (e.g., 
sonication) and affinity chromatography. For fusion 
products, subsequent digestion of the fusion protein 
with an appropriate proteolytic enzyme releases the 
desired peptide. 

The proteins may be purified to substantial purity by 
standard techniques well known in the art, including 
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selective precipitation with such substances as 
ammonium sulfate, column chromatography, 
immunopurification methods, and others. See, for 
instance. Scopes, R. [84], incorporated herein by 

5 reference . 

B. ^nin^icsl rpsts fcr the creser.ee cf 

ant- ibodies to the human herpesvirus. 
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This invention further embraces diagnostic kits for 
detecting the presence of a KS agent in biological 
samples, such as serum or solid tissue samples, 
comprising a container containing antibodies to the 
human herpesvirus, and instructional material for 
performing the test. Alternatively, inactivated viral 
particles or peptides or viral proteins derived from 
the human herpesvirus may be used in a diagnostic kit 
to detect for antibodies specific to the KS associated 
human herpesvirus. 



Diagnostic kits for detecting the presence of a KS 
agent in tissue samples, such as skin samples or 
samples of . other affected tissue, comprising a 
container containing a nucleic acid sequence specific 

25 for the human herpesvirus and instructional material 

fcr detecting the KS-associated herpesvirus are also 
included. A container containing nucleic acid primers 
to any one of such sequences is optionally included as 
are antibodies to the human herpesvirus as described 

3 0 here'in. 

Antibodies reactive with antigens of the human 
herpesvirus can also be measured by a variety of 
immunoassay methods that are similar to the procedures 
described above for measurement of antigens. For a 
review of immunological and immunoassay procedures 
aoolicable to the measurement of antibodies by 
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immunoassay techniques, see Basic and Clinical 
Immunology 7th Edition [12], and [32], supra. 

Ir. brief, immunoassays to measure antibodies reactive 
5 with antigens of the KS-associated human herpesvirus 

car. be . either competitive or noncompetitive binding 
assays. In competitive binding assays, the samp.e 
anaiyte competes with a labeled analyte for speciric 
binding sites on a capture agent bound tc a so.ic 

"0 surface. Preferably the capture agent is a puntiec 

recombinant human herpesvirus protein produced as 
described above. Other sources of human herpesvirus 
proteins, including isolated or partially purified 
naturally occurring protein, may also be used. 

15 Noncompetitive assays are typically sandwich assays, 

in which the sample analyte is bound between two 
analyte- specif ic binding reagents. One of the binding 
agents is used as a capture agent and is bound to a 
solid surface. The second binding agent is labelled 

2 0 and is used to measure or detect the resultant complex 

bv visual or instrument means. A number of 

combinations of capture agent and labelled binding 
agent can be used. A variety of different immunoassay 
formats, separation techniques and labels can be also 
25 be used similar to those described above fcr the 

measurement of the human herpesvirus antigens. 

Hemagglutination Inhibition (KI) and Complement 
Fixation (CF) which are two laboratory tests that can 

3 0 be used to detect infection with human herpesvirus by 

testing for the presence of antibodies against the 
virus or antigens of the virus. 

Serological methods can be also be useful when one 
35 wishes to detect antibody to a specific variant. For 

example, one may wish to see how well a vaccine 
recipient has responded to the new variant. 
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Alternatively, one may take serum from a paner.: tc 
see which variant the patient responds to the best. 

This invention provides an antagonist capable or 
blocking the expression of the peptide or polypeptide 
encoded by the isolated DNA molecule. In one 

embodiment the antagonist is capable of hybridizing 
with a double stranded DNA molecule. In another 
embodiment the antagonist is a triplex oligonucleotide 
capable of hybridizing to the DNA molecule. In 
another embodiment the triplex oligonucleotide is 
capable of binding to at least a portion of the 
isolated DNA molecule with a nucleotide sequence as 
shown in Figure 3A-3F (SEQ ID NOs : 1-37). 

This invention provides an antisense molecule capable 
of hybridizing to the isolated DNA molecule. In one 
embodiment the antisense molecule is DNA. In another 
embodiment the antisense molecule is RNA. 



The antisense molecule may be DNA or RNA or variants 
r hereof (i.e. DNA or RNA with a protein backbone). 
The present invention extends to the preparation of 
antisense nucleotides and ribozymes that may be used 
25 to interfere with the expression of the receptor 

recognition proteins at the translation of a specific 
mRNA, either by masking that MRNA with an antisense 
nucleic acid or cleaving it with a ribozyme . 

3 0 Antisense nucleic acids are DNA or RNA molecules that 

are complementary to at least a portion of a specific 
MRNA molecule. In the ceil, they hybridize to that 
MRNA, forming a double stranded molecule. The cell 
does 'not translate an MRNA in this double - stranded 
form. Therefore, antisense nucleic acids interfere 
with the expression of MRNA into protein. Oligomers 
o* about fifteen nucleotides and molecules that 
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hybridize to the AUG initiation codon are particularly 
efficient, since they are easy to synthesize and are 
likely to pose fewer problems than larger molecules 
upon introduction to cells. 

This invention provides a transgenic nor.huir.ar. mamma- 
which comprises at least a portion of the isolated DNA 
molecule introduced into the mammal at an emc-rycmc 
stage- Methods of producing a transgenic nonhuman 
10 mammal are known to those skilled in the art. 

This invention provides a ceil line containing the 
isolated KS associated herpesvirus cf the subject 
invention. In one embodiment the isolated DNA 

15 molecule is artificially introduced into the cell. 

Cell lines include, but are not limited to: 
fibroblasts, such as HFF, NIH/3T3; Epithelial cells, 
such as 5637; lymphocytes, such as FC3 ; T-ceils, such 
as CCRF-CEM (ATCC CCL 119); B-cells, such as B JAB and 
22 Ra]i (ATCC CCL 86); and myeloid cells such as K562 

(ATCC CCL 243); Vero cells and carcinoma cells. 
Methods of producing such ceil lines are known to 
those skilled in the art. In one embodiment the 
isolated KS associated herpesvirus is introduced into 
25 a RCC-1 ceil line. 

Ii: in vitro riiamos:^ assavF the d~~~ction of 

KS 
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This invention provides a method of diagnosing 
Kaposi's sarcoma in a subject which comprises: (a) 
obtaining a nucleic acid molecule from a tumor lesion 
of the subject.- (b) contacting the nucleic acid 
molecule with a labelled nucleic acid molecule of at 
least 15 nucleotides capable of specifically 
hybridizing with the isolated DNA. under hybridizing 
conditions; and (c) determining the presence of tne 
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nucleic acid molecule hybridized, the presence of 
which is indicative of Kaposi's sarcoma ir. the 
subject, thereby diagnosing Kaposi's sarcoma in the 
subj ect . 



_ u 



15 



in 



one embodiment the DNA molecule from the tumor 
lesion is amplified before step (b) . In another 
embodiment PCR is employed to amplify the nucleic ac:c 
molecule. Methods of amplifying nucleic acid 

molecules are known to those skilled in the art . 

A person of ordinary skill in the art will be able to 
obtain appropriate DNA sample for diagnosing Kaposi's 
sarcoma in the subject. The DNA sample obtained by 
the above described method may be cleaved by 
restriction enzyme. The uses of restriction enzymes 
to cleave DNA and the conditions to perform such 
cleavage are well-known in the art. 



30 



20 

In the above described methods, a size fractionation 
may be employed which is effected by a polyacrylamide 
gel. In one embodiment, the size fractionation is 
effected by an agarose gel. Further, transferring the 
DNA fragments into a solid matrix may be employed 
before a hybridization step. One example cf such 
solid matrix is nitrocellulose paper. 



This invention provides a method of diagnosing 
Kaposi's sarcoma in a subject which comprises: (a) 
obtaining a nucleic acid molecule from a suitable 
bodily fluid of the subject; (b) contacting the 
nucleic acid molecule .with a labelled nucleic acid 
molecules of at least 15 nucleotides capable of 
specifically hybridizing with the isolated DNA, under 
hybridizing conditions; and <c> determining the 
presence of the nucleic acid molecule hybridized, the 
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presence of which is indicative of Kaposi's sarcoma in 
the subject, thereby diagnosing Kaposi's sarcoma ir. 
the subject. 

5 This invention provides a method cf diagnosing a DNA 

virus in a subject, which comprises (a) obtaining a 
suitable bodily fluid sample from the subject, (b; 
contacting the suitable bodily fluid of the subject tc 
a support having already bound thereto a Kaposi's 
10 sarcoma antibody, so as to bind the Kaposi's sarcoma 

antibody to a specific Kaposi's sarcoma antigen, (c) 
removing unbound bodily fluid from the support, and 
(d) determining the level of Kaposi's sarcoma antibody 
bound by the Kaposi's sarcoma antigen, thereby 
diagnosing the subject for Kaposi's sarcoma. 



15 



This invention provides a method of diagnosing 
Kanosi's sarcoma in a subject, which comprises (a) 
obtaining a suitable bodily fluid sample from the 
20 subject, (b) contacting the suitable bodily fluid of 

the subject to a support having already bound thereto 
a Kaposi's sarcoma antigen, so as to bind Kaposi's 
sarcoma antigen to a specific Kaposi's sarcoma 
antibody, (c) removing unbound bodily fluid from the 
suooort, and (d) determining the level of the Kaposi's 
sarcoma antigen bound by the Kaposi's sarcoma 
antibody, thereby diagnosing Kaposi's sarcoma. 



3 0 This invention provides a method of detecting 

expression of a DNA virus associated with Kaposi's 
sa-coma in a cell which comprises obtaining total cDNA 
obtained from the cell, contacting the cDNA so 
obtained with a labelled DNA molecule unaer 

75 hvb-idizing conditions, determining the presence of 
-DNA hvbridized to the molecule, and thereby detecting 
t»e expression of the DNA virus. In one embodiment 
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mRNA is obtained from the cell to detect expression cf 
the DNA virus. 

The suitable bodily fluid sample is any bodily fluic 
sample which would contain Kaposi's sarcoma antibody, 
antigen or fragments thereof. A suitable bodily fluic 
includes, but is not limited to: serum, plasma, 
cerebrospinal fluid, lymphocytes, urine, transudates, 
or exudates. In the preferred embodiment, the 
suitable bodily fluid sample is serum or plasma. In 
addition, the bodily fluid sample may be cells from 
bone marrow, or a supernatant from a cell culture . 
Methods of obtaining a suitable bodily fluid sample 
from a subject are known to those skilled in the art. 
Methods of determining the level of antibody or 
antigen include, but are not limited to: ELISA, I FA, 
and Western blotting. Other methods are known to 
those skilled in the art. Further, a subject infected 
with a DNA virus associated with Kaposi's sarcoma may 
be diagnosed with the above described methods. 



The detection of the human herpesvirus and the 
detection of virus-associated KS are essentially 
identical processes. The basic principle is to detect 
25 the virus using specific ligands that bind to the 

virus but not to other proteins or nucleic acids in a 
normal human cell or its environs. The ligands can 
either be nucleic acid or antibodies. The ligands can 
be naturally occurring or genetically or physically 

3D modified such as nucleic acids with non-natural or 

antibody derivatives, i.e., Fab or chimeric 
antibodies. Serological tests for detection cf 
antibodies to the virus may also be performed by using 
protein antigens obtained from the human herpesvirus, 

35 and described herein. 
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Samples can be taken from patients with KS cr rrom 
patients at risk for KS , such as AIDS patients . 
Typically the samples are taken from blood (cells, 
serum and/cr plasma) or from solid tissue samples such 
as skin lesions. The most accurate diagnosis for KS 
will occur if elevated titers of the virus are 
detected in the blood or in involved lesions. KS may 
also be indicated if antibodies to the virus are 
detected and if ether diagnostic factors rcr hi is 
present . 

A. Nucleic acid assays. 

The diagnostic assays of the invention can be nucleic 
acid assays such as nucleic acid hybridization assays 
and assays which detect amplification of specific 
nucleic acid to detect for a nucleic acid sequence of 
the human herpesvirus described herein. 

Accepted means for conducting hybridization assays are 
known and general overviews of the technology can be 
had from a review of: Nucleic Acid Hybridization ; A 
Practical Approach [72] ; Hybridization of Nucleic 
Acids Immobilized on Solid Supports [41] ; Analytical 
Biochemistry [4] and Innis et al . , PCR Protocols [74], 
supra, all of which are incorporated by reference 



herein . 



If PCR is used in conjunction with nucleic acid 
hybridization, primers are designed to target a 
specific portion of the nucleic acid of the 
herpesvirus. Per example, the primers set fcrtn m 
SE2 ID NOs: 36-40 may be used to target detection of 
regions of the herpesvirus genome encoding ORF 25 
homologue - ORF 32 homologue . From the information 
provided herein, those of skill in the art will be 
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to select appropriate specific primers. 
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Targe! _ specific probes may be used in the nucleic acid 
hybridization diagnostic assays for KS . The probes 
are specific for or complementary to the target of 
interest. For precise allelic differentiations, the 
5 probes should oe about 14 nucleotides long and 

preferably about 20-30 nucleotides. For more general 
detection of the human herpesvirus of the invention, 
nucleic acid probes are about 50 to about 100C 
nucleotides, most preferably about 200 to about 4 00 
10 nucleotides. 

A sequence is "specific" for a target organism of 
inrerest . if it includes a nucleic acid sequence which 
when detected is determinative of the presence of the 
15 organism in the presence of a heterogeneous population 

of proteins and other biologies. A specific nucleic 
acid probe is targeted to that portion of the sequence 
which is determinative of the organism and will not 
hybridize to other sequences especially those of the 

2 0 host where a pathogen is being detected. 

The specific nucleic acid probe can be RNA or DNA 
polynucleotide or oligonucleotide, or their analogs. 
The probes may be single or double stranded 
25 nucleotides. The probes of the invention may be 

synthesized enzymatically , using methods well known in 
the art (e.g., nick translation, primer extension, 
reverse transcription, the polymerase chain reaction, 
and others) or chemically (e.g., by methods such as 

3 0 the phosphoramidite method described by 3eaucage and 

Carruthers [19], or . by the triester method according 
to Matteucci, et al . [62] , both incorporated herein by 
reference) . 



35 



The probe must be of sufficient length to be able to 
form* a stable duplex with its target nucleic acid in 
the samole, i.e., at least about 14 nucleotides, and 
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may be longer (e.g., at least about 50 or 100 bases in 
iensth) . Often the probe will be more than about ±00 
bases in length. For example, when probe is prepared 
by nick-translation of DNA in the presence of labeled 
= nucleotides the average probe length may be about 100- 

600 bases. 

As noted above, the probe will be capable cf specific 
hybridization to a specific KS-associated herpes virus 
-C nucleic acid. Such "specific hybridization" occurs 

when a probe hybridizes to a target nucleic acid, as 
evidenced by a detectable signal, under conditions in 
which the probe does not hybridize to other nucleic 
acids (e.g., animal cell or other bacterial nucleic 
15 acids) present in the sample. A variety cf factors 

including the length and base composition of the 
probe, the extent of base mismatching between the 
Probe and the target nucleic acid, the presence of 
salt and organic solvents, probe concentration, and 
2-* -he temperature affect hybridization, and optimal 

hybridization conditions must often be determined 
empirically. For discussions of nucleic acid probe 
design and annealing conditions, see. fcr example, 
[61], supra, Ausubel, F . , et a2 . [6] [hereinafter 
2 5 referred to as Sambrook] , Methods in Enzymoiogy [67^ 

cr Hybridization with Nucleic Acid Probes [42] all of 
which are incorporated herein by reference. 
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Usually, at least a part of the probe will ..ave 
considerable sequence identity with the target r.ucieic 
a-d Although the extent of the sequence laer.tity 
^a-"d for soecific hybridization will depend on the 
-,»; a -.h of the orobe and the hybridization conditions, 
^ probe will usually have at least 70% identity to 
-h- target nucleic acid, more usually at least 80* 
identity, still more usually at least 90% identity and 
most usually at least 95% or 100% identity. 
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A probe can be identified as capable of hybridizing 
specifically to its target nucleic acid by hybridizing 
the probe to a sample treated according the protocol 
of this invention where the sample contains both 
5 taraet virus and animal cells (e.g., nerve ceils) . A 

probe is specific if the probe's characteristic signal 
is associated with the herpesvirus DNA in the sample 
and not generally with the DNA of the host cells and 
non-biological materials (e.g., substrate) in a 
10 sample. 

The following stringent hybridization and washing 
conditions will be adequate to distinguish a specific- 
probe (e.g., a f luorescently labeled DNA probe) from' 
15 a probe that is not specific: incubation of the probe 

with the sample for 12 hours at 37°C in a solution 
containing denatured probe, 50% formamide, 2X SSC, and 
0.1% (w/v) dextran sulfate, followed by washing in IX 
SSC at 70°C for 5 minutes; 2X SSC at 37°C for 5 
2C minutes; 0 . 2X SSC at room temperature for 5 minutes, 

and H = 0 at room temperature for 5 minutes. Those of 
skill will be aware that it will often be advantageous 
ir. nucleic acid hybridizations (i.e., in situ, 
Southern, or other) to include detergents (e.g., 
25 sodium dodecyl sulfate), chelating agents (e.g., EDTA) 

cr ether reagents (e.g., buffers, Denhardt ' s solution, 
dextran sulfate) in the hybridization or wash 
solutions. To test the specificity of the virus 
specific probes, the probes can be tested on host 
30 ceils containing the KS-associated herpesvirus and 

compared with the results from cells containing non- 
KS-associated virus. 

It will be apparent to those of ordinary skill in the 
3 5 art that a convenient method for determining whether 

a orobe is specific for a KS-associated viral nucleic 
acid utilizes a Southern blot (or Dot blot) using DNA 
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prepared from one or more KS- associated human 
herpesviruses of the invention. Briefly, to identity 
a target specific probe DNA is isolated from the 
virus. Test DNA either viral or cellular is 

transferred to a solid (e.g., charged nylon) matrix. 
The probes are labelled following conventional 
methods. Following denaturation and/or 

r-rehybridization steps known in the art, the .probe is 
hybridized to the immobilized DNAs under stringent 
conditions. Stringent hybridization conditions wii- 
depend on the probe used and can be estimated from tne 
calculated (melting temperature) of the hybridized 
probe (see, e.g., Sambrook for a description of 
calculation of the TJ . For radioactively- labeled DNA 
or RNA probes an example of stringent hybridization 
conditions is hybridization in a solution containing 
denatured probe and 5x SSC at 65°C for 8-24 hours 
followed by washes in 0 . Ix SSC, 0.1% SDS (sodium 
dodecyl sulfate) at 50-65°C. In general, the 
temperature and salt concentration are chosen so that 
the post hybridization wash occurs at a temperature 
that is about 5°C below the T M of the hybrid. Thus for 
a particular salt concentration the temperature may be 
selected that is 5°C below the T M or conversely, for a 
25 particular temperature, the salt concentration is 

chosen to provide a T K for the hybrid that is 5°C 
warmer than the wash temperature. Following stringent 
hybridization and washing, a probe that hybridizes to 
the KS-associated viral DNA but not to the non-KS 
3 0 associated viral DNA, as evidenced by the presence of 

a signal associated with the appropriate target and 
-he absence of a signal from the non-target nucleic 
acids, is identified as specific for the KS associated 
virus. It is further appreciated that in determining 
2 5 probe specificity and in utilizing the method cf this 

invention to detect KS-associated herpesvirus, a 
certain amount of background signal is typical and can 
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easily be distinguished by one of skill from a 
specific signal. Two fold signal over background is 
acceptable . 

- A pre f er red method for detecting the KS- associated 

herpesvirus is the use of PCR and/or dec blot 
hybridization. The presence or absence of an KS agent 
for detection or prognosis, or risk assessment for KS 
includes Southern transfers, solution hybridization or 
10 non-radioactive detection systems, all cf whicn are 

well known to those of skill in the art. 
Hybridization is carried out using probes. 
Visualization of the hybridized portions allows the 
qualitative determination of the presence or absence 
of the causal agent . 



Id 



Similarly, a Northern transfer may be used for the 
detection of message in samples of RNA or reverse 
transcriptase PCR and cDNA can be detected by methods 
2C described above. This procedure is also well known in 

the art. See [81] incorporated by reference herein. 

An alternative means for determining the presence of 
the human herpesvirus is in situ hybridization, or 
25 more recently, in situ polymerase chain reaction. In 

si~v PCR is described in Neuvo en ai . [71], 
Intracellular localization of polymerase chain 
reaction ( PCR) -amplified Hepatitis C cDNA; Bagasra et 
al . [1C] , Detection of Human Immunodeficiency virus 
type 1 provirus in mononuclear cells by in situ 
Doivmerase chain reaction; and Heniford ec al . [35], 
Variation in cellular EGF receptor mRNA expression 
demonstrated by in situ reverse transcriptase 
polymerase chain reaction. In sjtu hybridization 
assays are well known and are generally described in 
Methods Enzymol. [SI] incorporated by reference 
herein. In an in situ hybridization, ceils are fixed 
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to a solid support, typically a class slide. The 
cells are then contacted with a hybridization solution 
at a moderate temperature to permit annealing of 
taraet-specif ic probes that are labelled. The probes 
5 are preferably labelled with radioisotopes or 

fluorescent reporters. 

The above described probes are also useful rcr m -situ 
hybridization or in order to locate tissues which 
express this gene, or for other hybridization assays 
for the presence of this gene or its MRNA in various 
biological tissues. In-situ hybridization is a 
sensitive localization method which is not dependent 
on expression of antigens or native vs. denatured 
conditions . 
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Oligonucleotide (oligo) probes, synthetic 

oligonucleotide probes or riboprobes made from KSHV 
phagemids/plasmids, are relatively homogeneous 
reagents and successful hybridization conditions in 
-issue sections is readily transferable from one probe 
zo another. Commercially synthesized oligonucleotide 
probes are prepared against the identified genes. 
These probes are chosen for length (45-65 mers) , high 
G-C content (50-70%) and are screened for uniqueness 
a ~-^ nsz other viral sequences in GenBank . 

Oligonucleotides are 3 ' end- labeled with [a-"S]dAT? to 
specific activities in the range of 1 x 1C 1: dpm/ug 
using terminal deoxynucieot idyl transferase. 
Unincoroorated labeled nucleotides are removed from 
the oligo probe by centrif ugat ion through a Sephadex 
G - 2 5 column or by e 
column . 



lution from a Waters Sep FaK C-1B 



KS tissue embedded in OCT compound and snap frozen m 
freezing isopentane cooled with dry ice is cut at 6 hit. 
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intervals and thawed onto 3 -aminopropyltriethoxysilane 
treated slides and allowed to air dry. The slides are 
then be fixed in 4% freshly prepared paraformaldehyde, 
rinsed in water. Formalin- fixed, paraffin embedded KS 
-issues cut at 6 and baked onto glass slides can 

also be used. The sections are then deparaf f ir.ized m 
xvler.es and rehydrated through graded alcohols. 
Prehybridization in 2 0mM Tris Ph 7.5, 0.02% Denhardt ' s 
solution, 10% dextran sulfate for 30 min at 37°C is 
followed by hybridization overnight in a solution or 
50% formamide (v/v) , 10% dextran sulfate (w/v) , 2 0mM 
sodium phosphate (Ph 7.4), 3X SSC, IX Denhardt ' s 
solution, 100 ug/ml salmon sperm DNA, 125 ug/ml yeast 
tRNA and the oiigo probe (lO^pm/ml) at 42°C overnight. 
The slides are washed twice with 2X SSC and twice with 
IX SSC for 15 minutes each at room temperature and 
visualized by autoradiography. Briefly, sections are 
dehydrated through graded alcohols containing 0.3M 
ammonium acetate and air dried. The slides are dipped 
in Kodak NTB2 emulsion, exposed for days to weeks, 
developed, and counterstained with hematoxylin and 
eoxin. Alternative immunohistochemical protocols may 
be employed which are known to those skilled in the 
art . 

IV . ^gg-mer- of human herp esvirus - i nduced KS 

This invention provides a method of treating a subject 
with Kaposi's sarcoma, comprising administering to the 
subject an effective amount of the antisense molecule 
caoable of hybridizing to the isolated DNA molecule 
under conditions such that the antisense molecule 
selectively enters a tumor cell of the subject, so as 
to treat the subject. 
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This invention provides a method for treating a 
subject with Kaposi's sarcoma (KS) comprising 
administering to the subject having a human 
herpesvirus -associated KS a pharmaceut ically effective 
amount cf an antiviral agent in a pharmaceutical iy 
acceptable carrier, wherein the agent is effective tc 
treat the subject with KS- associated human herpes 
virus . 

Further, this invention provides a method or 
prophylaxis or treatment for Kaposi's sarcoma (KS) by 
administering to a patient at risk for KS, an antibody 
that binds to the human herpesvirus in a 
pharmaceut ically acceptable carrier. In one 

embodiment the antiviral drug is used to treat a 
subject with the DNA herpesvirus of the subject 
invention. 

The use of combinations of antiviral drugs and 
sequential treatments are useful for treatment of 
herpesvirus infections and will also be useful for the 
treatment of herpesvirus- induced KS . For example, 
Snoeck e: al . [88], found additive or synergistic 
effects against CMV when combining antiherpes drugs 
(e.g., combinations of zidovudine [ 3 ' - azido- 3 ' - 
deoxythymidine, AZT] with HPMPC, ganciclovir, 
foscarnet or acyclovir or of KPMPC with other 
antivirals) . Similarly, in. treatment of 

cytomegalovirus retinitis, induction with ganciclovir 
followed by maintenance with foscarnet has been 
suggested as a way to maximize efficacy while 
minimizing the adverse side effects of either 
treatment alone. An ant i -herpetic composition that 
contains acyclovir and, e.g. , 2 - acetylpyridine - 5 - ( (2- 
pyridylamino) thiocarbonyl } - thiocarbonohydrazone is 
described in U.S. Pat. 5,175,165 (assigned to 
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Burrouahs Wellcome Co.). Combinations cf TS- 

inhibitors and viral TK- inhibitors in antiherpecic 
medicines are disclosed in U.S. Par. 5,137,724, 
assigned to Stichting Rega VZK . A synergistic 
inhibitory effect on EBV replication using certain 
ra - ios C f combinations of HPMPC with AZT was repcrted 
ev Lin et al . [56] . 
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U.S. Patent Nos. 5,164,395 and 5,021,437 (Biumenkcpf ; 
Burroughs Wellcome) describe the use c: 
ribonucleotide reductase inhibitor (an acetylpyridine 
derivative) for treatment of herpes infections, 
including the use of the acetylpyridine derivative in 
combination with acyclovir. U.S. Patent No. 5,137,724 
(Baizari et al . [11]) describes the use of thymilydate 
synthase inhibitors {e.g., 5 - f luoro-uracil and 5- 
f luro-2' -deoxyuridine) in combination with compounds 
having viral thymidine kinase inhibiting activity. 

With the discovery of a disease causal agent for KS 
now identified, effective therapeutic or prcphalactic 
protocols to alleviate or prevent the symptoms of 
herpes virus-associated KS can be formulated. Due to 
zhe viral nature of the disease, antiviral agents have 
application here for treatment, such as interferons, 
nucleoside analogues, ribavirin, amantadine, and 
pyrophosphate analogues of phosphonoacet ic acid 
(foscarnet) (reviewed in Gorbach, S.L., et al . [28]) 
and the like. Immunological therapy will also be 
effective in many cases to manage and alleviate 
symptoms caused by the disease agents described here. 
Antiviral agents include agents cr compositions that 
directly bind to viral products and interfere with 
disease progress; and, excludes agents that do not 
impact directly on viral multiplication cr viral 
titer. Antiviral agents do not include 

immuncregulatory agents that do not directly affect 
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viral titer cr bind to viral products. Antivira. 
agents are effective if they inactivate the virus, 
otherwise inhibit its infectivity or mult ipl teat ion , 
alleviate the symptoms of KS . 

5 

A. Antiviral Agents. 

"he antiherpesvirus agents that will oe useru. zzr 
treating virus - induced KS can be grouped into broad 
10 classes oased on their presumed modes cf action. 

These classes include agents that act n) by 
inhibition of viral DNA polymerase, (ii> by targeting 
other viral enzymes and proteins, (m) by 
miscellaneous or incompletely understood mechanisms, 
or (iv) by binding a target nucleic acid' {i.e., 
inhibitory nucleic acid therapeutics) . Antiviral 
agents may also be used in combination {i.e., together 
or sequentially) to achieve synergistic or additive 
effects or other benefits. 

Although it is convenient to group antiviral agents by 
their supposed mechanism of action, the applicants do 
not intend to be bound by any particular mechanism of 
antiviral action. Moreover, it will be understood by 
those cf skill that an agent may act on more than one 
-ar.get in a virus or virus - infected ceil cr through 
mere uhan one mechanism. 

i) Inhibitors of viral DNA polymerase 

Many antiherpesvirus agents in clinical use cr in 
development today are nucleoside analogs believed to 
act throuah inhibition of viral DNA replication, 
especially through inhibition of viral DNA polymerase . 
These nucleoside analogs act as alternative substrates 
for the viral DNA polymerase or as competitive 
inhibitors of DNA polymerase substrates. Usually 



30 



PCTVUS95/10194 

WO 96/06159 

60 

these agents are preferentially phosphorylated by 
viral thymidine kinase (TK) , if one is present , and/cr 
have higher affinity for viral DNA polymerase than for 
the cellular DNA polymerases, resulting in selective 
5 antiviral activity. Where a nucleoside analogue is 

incorporated into the viral DNA, viral activity or 
reproduction may be affected in a variety of ways. 
?or example, the analogue may act as a chair, 
terminator, cause increased lability (e.g., 
1C susceptibility to breakage) of analogue- containing 

DNA, and/or impair the ability of the substituted DNA 
to act as template for transcription or replication 
(see, e.g., Balzarini et a.1 . [11]). 



It will be known to one of skill that, like many 
drugs, many of the agents useful for treatment of 
herpes virus infections are modified (i.e., 
••activated") by the host, host cell, or virus-infected 
host cell metabolic enzymes. For example, acyclovir 
is triphosphorylated to its active form, with the 
first phosphorylation being carried out by the herpes 
virus "thymidine kinase, when present. Other examples 
are the reported conversion of the compound HOE 602 to 
ganciclovir in a three-step metabolic pathway (Winkler 
et al. [95]) and the phosphorylation of ganciclovir to 
"ts active form by, e.g., a CMV nucleotide kinase. It 
will be apparent to one of skill that the specific 
metabolic capabilities of a virus can affect the 
sensitivity of that virus to specific drugs, and is 
on- factor in the choice of an antiviral drug. The 
rr-chanism of action of certain ant i -herpesvirus agents 
-s discussed in De Clercq [22] and in other references 
cited supra and infra, all of which are incorporated 
by reference herein. 

Anti -herpesvirus medications suitable for treating 
viral induced KS include, but are not limited to. 
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nucleoside analogs including acyclic nucleoside 
or. osphonate analogs ( e . 9 ■ - 
"ohosphonylmethoxyalkyipurines and -pyridines; , ana 
"-yc'ic nucleoside analogs. These include drugs sucr. 
a's • v< aarabine (9-p-D-arabinofuranosyiadenine ; adenine 
arabinoside, ara-A, Vira-A, Parke-Davis) ; ^ 1-fi-D- 
arabinof uranosyluracil i « r a - U ) ; ^-t---- 
arabincfuranosyl-cytosine (ara-C); KPMPC L ,s'---i-- 
:-ydroxy-2-(pbosphonylme-hoxyipropyl]cy-osine (e.g., GS 

10 504 Giiead Science)] and its cyclic form (cKPKPC); 

H ? M P A [ ( S ) - 9 - ( 3 - n y d r => x y -2 - 
ohosDhcnylmethoxypropyl) adenine] ana its cyclic fonr, 
"'. cHPMFA ) ; (S)-HPMPDAP [ < S ) - S - I 3 - hydroxy - 2 - 

ohosphonylmethoxypropyl) -2 , 6-diaminopurinej ; PKEDAP 
"[9- (2-Dhosphonyl-methoxyethyl) -2 , 6 -diatr.inopunne] ; HOE 
6 0 2 * [2 -amino- 9- ( 1 , 3 -bis ( isopropoxy) - 2 - 
proDoxynethyllpurine] ; ? K E A [ 9 - ( 2 - 
phosphonylmethoxyethyl) adenine] ; bromovinyl- 
deoxyuridine (Burns and Sandford. [21]); 1-/3-D- 
arabincfuranosyl-E-5- (2-bromovinyl) -uridine or -2'- 
Qo-xyuridine; BVaraU ( 1 - 0-D-arabinof uranosyi - E- 5 - ( 2 - 
v-om'ovinyl) -uracil, brovavir. Bristol-Myers Squibb, 
vamsa Shoyu) ; BVDU [ (E) -5- (2-bromovinyl) -2' - 
deoxvuridine, brivudin. e.g.. Helpin] anc its 
carbocyclic analogue (in which the sugar moiety is 
replaced by a cyclopentane ring); IVDu [ (E) -5-^(2- 
iodovinyl) -2'-deoxyuridine] and its carbocyclic 
analogue, C-IVDU (Balzarini ec al . [II]) J; ana 5- 
mercutithio analogs of 2 ' -deoxyuridine (Hoiliday, J., 
, C and Williams, M.-V. [38]); acyclovir [S-U2- 

hvd-oxvethoxy] methyl) guanine; e.g., Zovirax (Burrougns 
Wellcome)] ; penciclovir (5- [ 4 - hydroxy - 2- 
(hvdroxymethyl) butyl] -guanine) ; ganciclovir [«S-il.3- 
dinvdroxy-2 propoxymethyl] -guanine ) e.g.. Cymevene , 
3= cytovene (Syntex) , DHPG (Stals ez al . !e9]j; 

isopropylether derivatives of ganciclovir (see, e.g., 
Win'icelmann et al . [94]); cygaiovir; famciclovir [2- 
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amino- 9- (4 -acetoxy-3 - (acetoxymethyl ) but -l-yl ) purine 
(Smithkline Beecham) ] ; valacyclovir (Burroughs 
Wellcome) ; desciclovir [ (2-amino- 9 - (2 - 
ethoxyme:hyl)purine)] and 2 - arr.ir.o- 9 - ( 2 - 
5 hydroxyethoxymethyl) - 9K-purine , prodrugs or 

acyclovir]; COG (carbocyclic 2 ' - deoxyguanosine ) ; and 
ourine nucleosides with the pentaf uranosyl ring 
replaced by a cyclo butane ring (e.g., cyclcbut-A [(-- 
) - 9- [1/5, 2a, 3 £ ) -2 , 3-bis (hydroxymethyl) - I - 
10 cyclobutyl] adenine] , cyclcbut-G [( + -)-S-[1£,2q,3/3)- 

2 , 3 -bis (hydroxymethyl) - 1- cyclobutyl] guanine] , BKCG 
[(R)-(la,2/5,la)-9-(2,3- 

bis (hydroxymethyl) cyclobutyl] guanine] , and an active 
isomer of racemic BHCG, SQ 34,514 [IK- la, 2£, 3a) -2- 

15 amino- 5- [2, 3 -bis (hydroxymethyl ) cyclobutyl] -€H-purin-6- 

one (see, Braitman et al.(1991) [20]]. Certain of 
these antiherpesviral agents are discussed in Gorach 
et al. [26]; Saunders et al . [82]; Yamanaka et al . , 
[95 ] ; Greenspan et al . [29], all of which are 

2C incorporated by reference herein. 

Triciribine and triciribine monophosphate are potent 
inhibitors against herpes viruses. (Ickes et al . [43], 
incorporated by reference herein), HIV- 1 and HIV-2 

25 CKucera en al . [51], incorporated by reference herein) 

and are additional nucleoside analogs that may be used 
zo treat KS . An exemplary protocol for these agents 
is an intravenous injection of about 0.3 5 mg/meter : 
(0.7 mg/kg) once weekly or every other week for at 

3 0 least two doses, preferably up to about four to eight 

weeks . 

Acyclovir and ganciclovir are of interest because of 
their accepted use in clinical settings. Acyclovir, 
3 5 an acyclic analogue of guanine, is phosphorylated by 

a herpesvirus thymidine kinase and undergoes further 
phosphorylation to be incorporated as a chain 
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terminator by the viral DNA polymerase during viral 
reolication. It has therapeutic activity against a 
broad range of herpesviruses, Herpes simplex Types 1 
and 2, Varicella- Zoster, cytomegalovirus, ar.~ 
5 Epstein-Barr Virus, and is used to treat disease such 

as herpes encephalitis, neonatal herpesvirus 
infections, chickenpox in immunocompromised hosts, 
herpes zoster recurrences, CMV retinitis, E3Y 
infections, chronic fatigue syndrome, and hairy 

10 leukoplakia in AIDS patients. Exemplary intravenous 

dosages or oral dosages are 250 mg/kg/nr body surface 
area, every 8 hours for 7 days, or maintenance doses 
of 200-400 mg IV or orally twice a day to suppress 
recurrence. Ganciclovir has been shown to be more- 

15 active than acyclovir against some herpesviruses. See, 

e.g., Oren and Soble [73]. Treatment protocols for 
ganciclovir are 5 mg/kg twice a day IV or 2 . 5 mg/kg 
three times a day for 10-14 days. Maintenance doses 
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are 5-6 mg/kg for 5-7 days. 



Also of interest is HPMPC. HPMPC is reported to be 
more active than either acyclovir or ganciclovir in 
the chemotherapy and prophylaxis of various KSV-1, 
KSV-2, TK- HSV, VZV or CMV infections in animal models 
25 { [22] , supra) . 

Nucleoside analogs such as BVaraU are potent 
inhibitors of HSV-1, EBV, and VZV that have greater 
Activity than acyclovir in animal models of 
encephalitis. FIAC ( f luroidoarbinosyl cytosine) and 
its related fluroethyl and iodo compounds (e.g., FEAU, 
ttt^u) have do tent selective activity against 
herpesviruses, and HPMPA ( (S) -I- ( [3 -hydroxy- 2 - 
phosphoryimethoxy] propyl) adenine) has oeen 
demonstrated to be more potent against HSV and CMV 
than acvclovir or ganciclovir and are of choice m 
advanced cases of KS . Ciadribine (2- 
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chlcrodeoxyadenosine) is another nucleoside analogue 
known as a highly specific antilymphocyte agent (i.e., 
a immunosuppressive drug) . 

Other useful antiviral agents include: 5 - thier. - 2 -yi - 
2 ' -deoxynridine derivatives, e.g., ETDU I - - ~ ( 5 - 
bromothien-2 -yl ) -2 ' -decxyuridine] and CTD'J [d- iS- 
chlcrcthien-2 -yi ) -2 ' -decxyuridine] ; and 0X7- A [9-(2- 
deoxy-2-hydroxymethyl-^-D-erythro-oxetanosyi) adenine] 

and OXT-G [5- ( 2 -deoxy- 2 -hydroxymethyi -/3-D-erythro- 
oxetanosyl) guanine] . Although OXT-G is believed tc 
act by inhibiting viral DNA synthesis its mechanism of 
action has not yet been elucidated. These and other 
compounds are described in Andrei ez ai . [5] which is 
incorporated by reference herein. Additional 
antiviral purine derivatives useful in treating 
herpesvirus infections are disclosed in US Pat. 
5,108,994 (assigned to Beecham Group P.L.C.). 6- 
Methoxypurine arabinoside (ara-M; Burroughs Wellcome) 
is a potent inhibitor of varicella- zoster virus, and 
will be useful for treatment of KS . 

Certain thymidine analogs [e.g., idoxuridine (5-ido- 
2 ' -deoxyuridine) ] and trif lurothymidine ) have 
antiherpes viral activity, but due to their systemic 
toxicity, are largely used for topical herpesviral 
infections, including KSV stromal keratitis and 
uveitis, and are not preferred here unless ether 
ootions are ruled out. 

Other useful antiviral agents that have demonstrated 
antiherpes viral activity include foscarnet sodium 
(trisodium phosphonof ormate , PFA, Foscavir (Astra)) 
and phosphonoacetic acid (PAA) . Foscarnet is an 
inorganic pyrophosphate analogue that acts by 
comoetitively blocking the pyrophosphate -binding site 
of ~DNA oolymerase. These agents which block DNA 



PCT/US95/10194 

WO 96/06159 

€5 

Dolymerase directly without processing by viral 
thymidine kinase. Foscarner is reported to be less 
toxic than PAA. 
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ii) Agents that: target viral proteins ether 
than DNA polymerase or other 
f unct icr.s . 



viral 



Although applicants do not intend to be bound by a 
particular mechanism of antiviral action, the 
antiherpes-virus agents described above are believed 
to act through inhibition of viral DNA polymerase. 
However, viral replication requires not only the 
replication of the viral nucleic acid but also the 
production of viral proteins and other essential 
components. Accordingly, the present invention 
contemplates treatment of KS by the inhibition of 
viral proliferation by targeting viral proteins other 
than DNA polymerase (e.g.. by inhibition of their 
synthesis or activity, or destruction of viral 
proteins after their synthesis). For example, 
administration of agents that inhibit a viral serine 
protease, e.g., such as one important in development 
of the viral capsid will be useful in treatment of 
viral induced KS . 

Other viral enzyme targets include: OMP decarboxylase 
inhibitors (a target of, e.g., parazcf urir. ) , CT? 
synthetase inhibitors (targets of, e.g., 
cyclopentenylcytosine) , IMP dehydrogenase, 
ribonucleotide reductase (a target of/ e.g., carnoxyl- 
containing N-alkyldipeptides as described in U.S. 
Patent No. 5,110,799 (Tolman et al . . MerckH. 
thvmidine kinase (a target of, e.g., i-12- 
(hydroxymethyl) cy c 1 oa iky lmethy i ] - 5 - subs 1 1 tut e a 
-uracils and -guanines as described in, e.g.. U.S. 
patent Bos'. 4,863,927 and 4,782.052 (Tolman et al . ; 
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Merck)) as well as other enzymes. It will be apparent 
to one of ordinary skill in the art that there are 
additional viral proteins, both characterized and as 
yet to be discovered, that can serve as target for 
5 antiviral agents. 

iv) Other agents and modes of ar.tivira. 
action . 

1C Kutapressin is a liver derivative available froir. 

Schwarz Parma of Milwaukee, Wisconsin in an injectable 
form of 25 mg/ml . The recommended dosage for 
herpesviruses is from 200 to 25 mg/ml per day for an 
average adult of 150 pounds. 

Poly (I) Poly(C, 2 U) , an accepted antiviral drug known as 
Ampligen from HEM Pharmaceuticals of Rockviile, MD has 
been shown to inhibit herpesviruses and is another 
antiviral agent suitable for treating KS . Intravenous 
20 injection is the preferred route of administration. 

Dosages from about 100 to 600 mg/nr are administered 
:wc :c three times weekly to adults averaging 150 
rounds. It is best to administer at least 200 mg/nr 
r?er week. 

2 E 

Other antiviral agents reported to show activity 
aoainst herpes viruses (e.g., varicella zoster and 
herpes simplex) and will be useful for the treatment 
of "herpesvirus- induced KS include- mappicine ketone 
30 (SmithKline Beecham) ; Compounds A, 79296 and A, 73209 

(Abbott) for varicella zoster, and Compound BS2C87 
(Burroughs Wellcome) [see, The Pink Sheet 55(20) May 
17 , 1993] . 

— interferon is known inhibit replication of herpes 

viruses. See [73], supra. Interferon has known 
-ox-city problems and it is expected that second 
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generation derivatives will soon be available 
will retain interferon's antiviral properties but have 
reduced side affects. 

It is also contemplated that herpes virus - induced K = 
may be treated by administering a herpesvirus 
reactivating agent to induce reactivation cz tne 
latent virus. Preferably the reactivation is combined 
with simultaneous or sequential administration of an 
anti -herpesvirus agent. Controlled reactivation ever 
a short period of time or reactivation in tne presence 
of an antiviral agent is believed to minimize the 
adverse effects of certain herpesvirus infections 

,e.g.. as discussed in PCT Application WO S3/046S3). 
Reactivating agents include agents such as estrogen, 
phorbol esters, forskolin and /3-adrenergic blocking 

agents . 

Agents useful for treatment of herpesvirus infections 
and for treatment of herpesvirus - induced KS are 
described in numerous U.S. Patents. For example, 
sar.cicisvir is an example of a antiviral guanine 
acyclic nucleotide of the type described in US Patent 
Kos. 4,2=5,032 and 4,603,219. 

Acyclovir is an example of a class of antiviral purine 
derivatives, including 9-(2- 

hvdroxyethylmethyi) adenine, of the type described in 
u'.S. Pat. Nos. 4,287,186, 4 , 2 94 , 631. ana 4 , 195 , 574 . 

Brivudin is an example of an antiviral deoxyuricme 
derivative of the type described in US Patent No. 
4 , 424 , 211 - 



Vidarabine is an example of an antiviral purin* 
nucleoside of the type described in Britisn 
1,159,290. 
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3rovavir is an example of an antiviral deoxyuridin< 
derivative of the type described in US Patent Nos 
4,542,210 and 4,386,076. 



BHCG is an example of an antiviral 
nucleoside analogue of the type describee 
Nos. 5,153,252, 5,034,394 and 5,126,345. 



H?M?C is an example of an antiviral pnospr.ony. 
1C methoxyaikyl derivative with cf the :> ? e described an 

US Patent No. 5,142,051. 

CDG (Carbocyclic 2 ' -deoxyguanosine ) is an example of 
an antiviral carbocyclic nucleoside analogue of the 
15 type described in US Patent Nos. 4,542,255, 4,855,466, 

and 4 , 894 , 458 . 



Poscarnet is desc 



ribed in US Patent No. 4,33 9,445. 



35 



Trifiuridine and its corresponding ribonucleoside is 
described in US Patent No. 3,201,367. 

■ j . S - Patent No. 5,321,030 (Kaddurah-Daouk e: al . ; 
Arr.ira!- describes the use of creatine analogs as 
antiherpes viral agents. U.S. Patent No. 5,306,722 
{Kim e: al . ; Bristol -Meyers Squibb' describes 
thymidine kinase inhibitors useful for treating HSV 
infections and for inhibiting herpes thymidine kinase. 
Other antiherpesvirus compositions . are described m 
U.S. Patent Nos. 5,286,649 and 5,093,708 (Konishi ec 
al., Bristol-Meyers Squibb) and 5,175,165 (Blumenkopf 
ez al.; Burroughs Wellcome). U.S. Patent No. 

4,880,820 (Ashton et aJ . ; Merck) describes the 
antiherpes virus agent (S) -9- [ 2 , 3 - dihydroxy- 1 - 
propoxymethyl ) guanine . 
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U.S. Patent No. 4,705, 925 (Suhadolnik e: al - ; Research 
Corporation) describes a 3 ' -deoxyadenosme compound 
effective in inhibiting HSV and E3V . U.S. Patent No. 
4,365,076 (Kachida e: al . ; Yamasa Shoyu Kabushiki 
K a i s h a ) describes use er- 

as an 
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£) -5- !2-halogsnovinyl) -arabinsfuranoEv.ura::- 
ar.tiherpesvirus agent. C.S. Patent No. 

■;i.ieb e= al . ; 3ayer AKtier.geseils-h^^, ■— s 

chosohonohyarcxyacetic acid derivatives useful as 
antiherpes agents. U.S. Patent Nos . 4.093,71= and 
4,093,716 Cin en al • Research Corporation) descnoe 
5 • -amino- 5 ' - de oxy thymidine and 5-ioac-='- 
ar.ino-2' ,5' -dideoxycytidine as potent inhibitors of 
heroes sxmpiex virus. U.S. Patent No. 4,069.382 
(Baker e: al . ; Parke, Davis & Company) describes 
9- (5-o-Acyl-beta-D-arabinofuranosyl) adenine compounds 
useful as antiviral agents. U.S. Patent No. 3.927,216 
(Witkowski et al.) describes the use of 
l,2,4-triazole-3-carboxarr. ide and 
','2.4- -riazole- 3 - thioearboxamiae for inhibiting herpes 
virus infections. Patent No. 5,179,093 (Afonso et 
al., Schering! describes qumcl me - 2 . 4 -dione 
derivatives active against herpes simplex virus 1 and 
2, cvtomegaiovirus and Epstein Barr virus. 

vi Inhibitory nucleic acid therapeutics 

Also contemplated here are inhibitory nucleic acid 
therapeutics which can inhibit the activity^ of 
herpesviruses in patients with KS . Inhibitory nucleic 
acids may be single-stranded nucleic acids, whicn can 
specifically bind to a complementary nucleic acio 
sequence. By binding to the appropriate target 
sequence, an RNA-RNA, a DNA-DNA, or F-KA-DNA =u?iex^or 
-*-p-.ex is formed. These nucleic acids are otter, 
termed "antisense" because they are ^usually 
complementary to the sense or coding stranc or tne 
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gene, although recently approaches for use cf "sense" 
nucleic acids have also been developed. The terrr. 
"inhibitory nucleic acids" as used herein, refers to 
both "sense" and " antisense " nucleic acids. 



3y binding to the target nucleic acid, the ir.n:r::cry 
nucleic acid car. inhibit the function of the target 
nucleic acid. This could, for example, be a result c:. 
blocking DNA transcription, processing or poly (A) 

LO addition to mRNA, DNA replication, translation, or 

promoting inhibitory mechanisms of the cells, such as 
promoting RNA degradation. Inhibitory nucleic acid 
methods therefore encompass a number of different 
accroaches to altering expression of herpesvirus 

15 genes. These different types of inhibitory nucleic 

acid technology are described in Helene, C. and 
Toulme, J . [34], which is hereby incorporated by 
reference and is referred to hereinafter as "Helene 
and Toulme . '' 

In brief, inhibitory nucleic acid therapy approaches 
can be classified into those that target DNA 
sequences, those that target RNA sequences (including 
ere -mRNA and mRNA), those that target proteins (sense 
25 strand approaches), and those that cause cleavage or 

chemical modification of the target nucleic acids. 

Approaches targeting DNA fall into several categories. 
Nucleic acids can be designed to bind to the major 
3 0 groove of the duplex DNA to form a triple helical or 

"triplex" structure. Alternatively, inhibitory 

nucleic acids are designed to bind to regions of 
single stranded DNA resulting from the opening cf the 
duplex DNA during replication or transcription. See 
~s 5 Helene and Toulme. 
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More commonly, inhibitory nucleic acids are designed 
:o bind co mRNA or mRNA precursors. Inhibitory 
nucleic acids are used to prevent maturation of pre- 
mRNA. inhibitory nucleic acids may be designed to 
5 interfere with RNA processing, splicing or 

translation . 

-he inhibitory nucleic acids can be targeted to mRNA. 
In this approach, the inhibitory nucleic acids are 
10 designed to specifically block translation of the 
encoded nrotein. Using this approach, the inhibitory 
nucleic "acid can be used to selectively suppress 
certain cellular functions by inhibition of 
translation of mRNA encoding critical proteins. For 
examole, an inhibitory nucleic acid complementary to 
regions of c-myc mRNA inhibits c-myc protein 
expression in a human promyelocyt ic leukemia cell 
line, HL60, which overexpresses the c-myc proto- 
oncogene. See Wickstrom E.L., et al . [93] and 
Harel-Bellan, A., et al . [31A]. As described in 
Helens and Toulme , inhibitory nucleic acids targeting 
mRNA have been shown to work by several different 
mechanisms to inhibit translation of the encoded 
protein (s) . 

-he inhibitory nucleic acids introduced into the cell 
can also encompass the "sense" strand of the gene or 
mRNA to trap or compete for the enzymes or binding 
crotems involved in mRNA translation. See Helene and 
3 o Toulme . 

Lastly, the inhibitory nucleic acids can be used to 
induce' chemical mactivation or cleavage of the '- ar ^ 
genes or mRNA . Chemical inactivation can occur ey tne 
3 5 induction of crosslinks between the inhibitory nucleic 

acid and the target nucleic acid within the cell. 
Other chemical modifications of the target nucleic 
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acids induced by appropriately derivatized inhibitory 
nucleic acids may also be used. 

Cleavage, and therefore inactivar ion , cf the target 
5 nucleic acids may be effected by attaching a 

substituent to the inhibitory nucleic acid which car. 
be activated to induce cleavage reactions. The 
substituent can be one that affects either chemical, 
or enzymatic cleavage. Alternatively, cleavage can 
10 be induced by the use of ribozyTr.es or catalytic RNA. 

In this approach, the inhibitory nucleic acids would 
comprise either naturally occurring RNA (ribozymes) or 
svnthetic nucleic acids with catalytic activity. 
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The targeting of inhibitory nucleic acids to specific 
ceils of the immune system by conjugation with 
targeting moieties binding receptors on the surface of 
these cells can be used for all of the above forms of 
inhibitory nucleic acid therapy. This invention 
encompasses all of the forms of inhibitory nucleic 
acid therapy as described above and as described in 
Heiene and Toulme. 

This invention relates to the targeting cf inhibitory 
nucleic acids to sequences the human herpesvirus of 
the invention for use in treating KS . An example of 
an antiherpes virus inhibitory nucleic acid is ISIS 
2S22 (ISIS Pharmaceuticals) which has activity against 
CKV [see, Biozechnology News 14 (14). p. 5] . 

A problem associated with inhibitory nucleic acid 
the'raoy is the effective delivery cf the inhibitory 
nucleic acid to the target ceil in vivo and the 
subseauent internalization of the inhibitory nucleic 
acid by that cell. This can be accomplished by 
linking the inhibitory nucleic acid to a targeting 
moiety to form a conjugate that binds to a specific 
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receptor on the surface of the target infected cell, 
and which is internalized after binding. 

iii) Administration 

The subjects to be treated or whose tissue may be used 
herein may be a mammal, or more specifically a human, 
horse, pig, rabbit, dog, monkey, or rodent. In tne 
oref erred embodiment the subject is a human. 

The compositions are administered in a manner 
compatible with the dosage formulation, and in a 
therapeutically effective amount. Precise amounts of 
active ingredient required to be administered depend 
15 on the ludgment of the practitioner and are peculiar 

to each subject. 

Suitable regimes for initial administration and 
booster shots are also variable, but are typified by 
an initial administration followed by repeated doses 
£: cne or more hour intervals by a subsequent 
infection or other administration. 

As used herein administration means a method of 
administering to a subject. Such methods are well 
known to those skilled in the art and include, but are 
not limited to, administration topically, 
parenterally, orally, intravenously, intramuscularly, 
subcutaneously or by aerosol. Administration of the 
aaent may be effected continuously or intermittently 
such that the therapeutic agent in the patient is 
effective to treat a subject with Kaposi's sarcoma or 
a subject infected with a DNA virus associated with 
Kaposi's sarcoma. 

The antiviral compositions for treating herpesvirus - 
induced KS are preferably administered to human 
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patients via oral, intravenous or parenteral 
administrations and other systemic forms. Those of 
skill in the art will understand appropriate 
administration protocol for the individual 
5 compositions to be employed by the physician. 

The pharmaceutical formulations or compositions of 
this invention may be in the dosage form cf solid, 
semi-solid, or liquid such as, e.g., suspensions, 
i: . aerosols or the like. Preferably the compositions are 

administered in unit dosage forms suitable for single 
administration of precise dosage amounts. The 
compositions may also include, depending on the 
formulation desired, pharmaceutical^- acceptable , non- 
toxic carriers or diluents, which are defined as 
vehicles commonly used to formulate pharmaceutical 
compositions for animal or human administration. The 
diluent is selected so as not to affect the biological 
activity of the combination. Examples of such 
diluents are distilled water, physiological saline, 
Ringer's solution, dextrose solution, and Hank's 
solution. In addition, the pharmaceutical composition 
cr formulation may also include ether carriers, 
adjuvants; or nontoxic, nontherapeut ic , nonimmunogenic 
stabilizers and the like. Effective amounts cf such 
diluent or carrier are those amounts which are 
effective to obtain a pharmaceutical^ acceptable 
formulation in terms of solubility of components, or 
biological activity, etc. 
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Immune 



logical Approac' ^-^ tic Therapy. 



Having identified a primary causal agent of KS in 
humans as a novel human herpesvirus, there are 
^nmunosuooressive therapies that can modulate the 
immunologic dysfunction that arises from the presence 
of viral infected tissue. In particular, agents that 
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block the immunological attack of the viral infectea 
cells will ameliorate the symptoms of KS and/or reduce 
the disease progress. Such therapies include 

antibodies that specifically block the targeting cf 
viral infected ceils. Such agents include antibodies 
which bind to cytokines that upreguiate the immune 
system to target viral infected ceils. 

The antibody may be administered to a patient either 
singly or in a cocktail containing two or more 
antibodies, other therapeutic agents, compositions, or 
the like, including, but not limited to, immuno- 
suppressive agents, potentiators and side-efrect re- 
lieving agents. Of particular interest are immuno- 
suppressive agents useful in suppressing allergic re- 
actions of a host. Immunosuppressive agents of inter- 
est include prednisone, prednisolone, DECADRON (Merck, 
Sharp & Dohme, West Point, PA), cyclophosphamide, 
cyclosporine , 6 -mercaptopurine , methotrexate , 

azathicprme and i.v. gamma globulin or their 
combination. Potentiators of interest include 

monensin, ammonium chloride and chloroquine. Ail of 
these agents are administered in generally accepted 
efficacious dose ranges such as those disclosed in the 
25 Physician Desk Reference, 41st Ed. (1967), Publisher 

Edward R. 3arnhart , New Jersey. 

immune globulin from persons previously infected with 
human herpesviruses or related viruses can be obtained 
using standard techniques. Appropriate titers of 
antibodies are known for this therapy and are readily 
aoplied to the treatment of KS . Immune globulin can 
be* administered via parenteral injection cr by 
intrathecal shunt. In brief, immune globulin 

preparations may be obtained from individual donors 
who" are screened for antibodies to the KS-associated 
human herpesvirus, and plasmas from high-titerec 
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donors are pooled. Alternatively, plasmas from donors 
are pooled and then tested for antibodies to the human 
herpesvirus of the invention; high-titered pools are 
then selected for use in KS patients. 

5 

Antibodies may be formulated into an injectable 
preparation . Parenteral formulations are known and 
are. suitable for use in the invention, preferably for 
i.m. or i.v. administration. The formulations 
10 containing therapeutically effective amounts of 

antibodies or immunotoxins are either sterile liquid 
solutions, liquid suspensions or lyophiiized versions 
and optionally contain stabilizers or excipients. 
Lyophiiized compositions are reconstituted with 
suitable diluents, e.g., water for injection, saline, 
0.3% glycine and the like, at a level of about from 
.01 mg/kg of host body weight to 10 mg/kg where 
appropriate. Typically, the pharmaceutical 

compositions containing the antibodies or immunotoxins 
will be administered in a therapeutically effective 
dose in a range of from about .01 mg/kg to about 5 
mg/kg of the treated mammal. A preferred 

therapeutically effective dose of the pharmaceutical 
composition containing antibody or immunotoxir. will be 
in a range of from about 0.01 mg/kg to about C.5 mg/kg 
body weight of the treated mammal administered over 
several days to two weeks by daily intravenous 
infusion, each given over a one hour period, in a 
seauential patient dose-escalation regimen. 
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Antibody may be administered systemicaiiy by injection 
i.m., subcutaneously or intraperitoneal or directly 
into KS lesions. The dose will be dependent upon the 
properties of the antibody or immunotoxm employed, 
e.g., its activity and biological half-life, the 
concentration of antibody in the formulation, the site 
and rate of dosage, the clinical tolerance of the 
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oatient involved, the disease afflicting the patient 
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and the like as is well within the skill or 
physician . 

The antibody of the present invention may be 
administered in solution. The pK of the solution 
should be in the range of pH 5 to 9 . 5 , preferably pH 
6.5 to 7.5. The antibody or derivatives thereof 
should be in a solution having a suitable 
pharmaceutical acceptable buffer such as phosphate, 
tris (hydroxymethyl) aminome thane -HCl or citrate and 
the like. Buffer concentrations should be in the 
ranee of 1 to 100 mM. The solution of antibody may 
also contain a salt, such as sodium chloride or 
potassium chloride in a concentration of 50 to 150 mM. 
An effective amount of a stabilizing agent such as an 
albumin, a globulin, a gelatin, a protamine or a salt 
of protamine may also be included and may be added to 
a solution containing antibody or immunotoxin or to 
the composition from which the solution is prepared. 

Systemic administration of antibody is made daily, 
generally by intramuscular injection, although 
intravascular infusion is acceptable. Administration 
may also be intranasal or by other nonparenteral 
-outes. Antibody or immunotoxin may also be 

administered via microspheres, liposomes or other 
microparticulate delivery systems placed m certain 
tissues including blood. 

Tn therapeutic applications , the dosages cf compounds 
us~a in accordance with the invention vary depending 
on the class of compound and the condition being 
seated. The age, weight, and clinical condition of 
the recipient patient; and the experience and judgment 
of the clinician or practitioner administering the 

~ ~- r-y~<= - —-z ~he selected 
therapy are among the .actors a~~e. 
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dosage. For example, the dosage of an immunoglobulin 
can range from about 0.1 milligram per kilogram of 
body weight per day to about 10 mg/kg per day for 
polyclonal antibodies and about 5% to about 20% of 
5 that amount for monoclonal antibodies. In such a 

case, the immunoglobulin can be administered once 
daily as an intravenous infusion. Preferably, the 
dosage is repeated daily until either a therapeutic 
result is achieved or until side effects warrant 
discontinuation of therapy. Generally, the dose 
should be sufficient to treat or ameliorate symptoms 
or signs of KS without producing unacceptable toxicity 
to the patient . 
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An effective amount of the compound is that which 
provides either subjective relief of a symptom(s) or 
an objectively identifiable improvement as noted by 
the clinician or other qualified observer. The dosing 
ranae varies with the compound used, the route of 
administration and the potency of the particular 
compound . 

VI . Vaccines and Prophvl^ v-i s for KS 

This invention provides a method of vaccinating a 
subject against Kaposi's sarcoma, comprising 
administering to the subject an effective amount of 
-he peptide or polypeptide encoded by the isolated DNA 
molecule, and a suitable acceptable carrxer, thereby 
vaccinating the subject. In one embodiment naked DNA 
is administering to the subject in an effective amount 
to vaccinate a subject against Kaposi's sarcoma. 

tu<s invention orovides a method of immunizing a 
subject against a disease caused by the DNA 
herpesvirus associated with Kaposi's sarcoma wnich 
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comprises ' 



administering to the subject an elective 



immunizing dose of the isolated herpesvirus vaccine. 



A. Vaccines 



The invention also provides substances 
use as vaccines 
fcr administering them. The vaccines are directed 
against the human herpesvirus of the mventic 
most preferably 
associated human herpesvirus. 



for the ore vent ion of KS anc metnca^ 



iblv comprise antigen obtained from tne KS- 



m 



Vaccines can be made recombinant ly . Typically, a 
vaccine will include from about 1 to about 50 
micrograms of antigen or antigenic protein or peptide. 
More preferably, the amount of protein is from about 
15 to about 45 micrograms. Typically, the vaccine is 
formulated so that a dose includes about 0.5 
Liiiters. The vaccine may be administered by any 
route known in the art. Preferably, the route is 
parenteral. More preferably, it is subcutaneous or 
intramuscular. 

There are a number of strategies for amplifying an 
25 antigen's effectiveness, particularly as related to 

the art of vaccines. For example, cyclizacion or 
circularization of a peptide can increase the 
peptide's antigenic and immunogenic potency. See U.S. 
Pat. No. 5,001,049 which is incorporated by reference 
3C herein. More conventionally, an antigen can be 

conjugated to a suitable carrier, usually a protein 
molecule. This procedure has several facets. It can 
allow multiple copies of an antigen, such as a 
oeotide, to be conjugated to a single larger carrier 
molecule. Additionally, the carrier may possess 
properties which facilitate transport , binding, 
absorption or transfer of the antigen. 
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For parenteral administration, such as subcutaneous 
injection, examples of suitable carriers are the 
tetanus toxoid, the diphtheria toxoid, serum album.r. 
and lamprey, or keyhole limpet, hemocyan.n because 
they provide the resultant conjugate with minimum 
aenetic restriction. Conjugates including these 
universal carriers can function as T cell cior.e 
activators in individuals having very different gene 



sets . 



The conjugation between a peptide and a carrier can be 
accomplished using one of the methods known in the 
art. Specifically, the conjugation can use 

bifunctional cross-linkers as binding agents as 
detailed, for example, by Means and Feeney, "A recent 
review of protein modification techniques," 
Bioconjugate Chew. 1:2-12 (1990). 

Vaccines against a number of the Herpesviruses have 
20 been successfully developed. Vaccines against 

Varicella-Zoster Virus using a live attenuated Oka 
strain is effective in preventing herpes zoster in the 
elderly, and in preventing chickenpox in both 
immunocompromised and normal children (Hardy, I., et 

25 al. [30]; Hardy, I. et al . [31]; Levin, K.J. ec al . 

r 54 i ; Gershon, A. A. [26] . Vaccines against Herpes 
simolex Types 1 and 2 are also commercially available 
w <.h some success in protection against primary 
disease, but have been less successful in preventing 

3 0 the establishment of latent infection in sensory 

ganglia (Roizman, B . [78]; Skinner, G.R. e:al. [87]). 

vaccines aaainst the human herpesvirus can be made by 
isolating extracellular viral particles from infected 
3= ce^l cultures, inactivating the virus with 

formaldehyde followed by ultracentrif ugation to 
concentrate the viral particles and remove the 
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formaldehyde, and immunizing individuals witr. 2 cr ; 
doses containing 1 x 10' virus particles (Skinner, C-.R. 
ec al. [B6]). Alternatively, envelope glycoproteins 
can be expressed in E. coli or transfected into stable 
mammalian cell lines, the proteins can be purified jind 
used for vaccination (Lasky, L.A. [53]). -MKC 
binding peptides from cells infected with the human 
herpesvirus can be identified fcr vaccine candidates 
oer the methodology of [61], supra. 

The antigen may be combined or mixed with various 
solutions and other compounds as is known in the art. 
For example, it may be administered in water, saline 
or buffered vehicles with or without various adjuvants 
or immunodi luting agents. Examples of such adjuvants 
or agents include aluminum hydroxide, aluminum 
phosphate, aluminum potassium sulfate (alum), 
beryllium sulfate, silica, kaolin, carbon, water-in- 
oil emulsions, oil-in-water emulsions, muramyl 
0 dipeptide, bacterial endotoxin, lipid X, 

Corynebacterium parvum (Propionibacterium acnes), 
Sordetella pertussis, polyribonucleotides, sodium 
alginate, lanolin, lysoiecithin, vitamin A, saponin, 
liposomes, levamisole, DEAS-dextran. blocked 
= co'oolymers or other synthetic adjuvants. Such 

adjuvants are available commercially from various 
sources, for example, Merck Adjuvant 65 (Merck and 

Pahwav N J ) or Freund's Incomplete 
Comoany, Inc., Ranway, im.j.; <->- 

A^i'uvant and Complete Adjuvant (Difco Laboratories, 
3 r, o-^oit. Michiaan). Other suitable adjuvants are 

Amohigen (oil-in-water). Aihydrogel (aluminum 
hydroxide), or a mixture of Amohigen ana Aihycroge,. 
Only aluminum is approved for human use. 

The orooortion of antigen and adjuvant can be varied 
ove- a broad range so long as both are present in 
.«.«ive amounts. For example, aluminum hydroxiae 
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can be present in an amount of about 0.5% c: zne 
vaccine mixture (A1 2 0 3 basis). On a per-dose basis, 
rhe amount of the antigen can range from about 0.2 ug 
:o about 100 u9 protein per patient. A preferable 

5 ranae is from about 1 ug to about 50 uS per dose. A 

more preferred range is about 15 ug to about 45 ug - 
A suitable dose size is about 0.5 ml. Accordingly, a 
dose for intramuscular injection, for example, would 
comprise 0.5 ml containing 45 ug of antigen in 

10 admixture with 0.5% aluminum hydroxide. After 

formulation, the vaccine may be incorporated into a 
sterile container which is then sealed and stored at 
a low temperature, for example 4°C, cr it may be 
f reeze-dried. Lyophilizat ion permits long-term 

15 storage in a stabilized form. 

The vaccines may be administered by any conventional 
method for the administration of vaccines including 
oral and parenteral (e.g., subcutaneous or 
20 intramuscular) injection. intramuscular 

administration is preferred. The treatment may 
consist of a single dose of vaccine or a plurality of 
doses over a period of time. It is preferred that the 
dose be given to a human patient within the first 6 
months of life. The antigen of the invention can be 
combined with appropriate doses of compounds including 
influenza antigens, such as influenza type A antigens. 
Also, the antigen could be a component of a 
recombinant vaccine which could be adaptable for oral 
30 administration. 
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Vaccines of the invention may be combined with other 
vaccines for other diseases to produce multivalent 
vaccines. A pharmaceut ically effective amount of the 
35 an-ia-n can be employed with a pharmaceut ically 

acceptable carrier such as a protein or diluent useful 
for the vaccination of mammals, particularly humans. 
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Other vaccines may be prepared according to me-.hods 
well-known those skilled in the art. 

These cf skill will readily recognize that it is or.ly 
necessary to expose a mammal to appropriate epitopes 
ir. order to elicit effective immuncprotecticr. . Trie 
epitopes are cypically segments of amino acids which 
are a small portion cf the whole protein. -sing 
recombinant genetics, it is routine to alter a natural 
protein's primary structure to create derivatives 
embracing epitopes that are identical to or 
substantially the same as (immunologically equivalent 
co) the naturally occurring epitopes. Such 
derivatives may include peptide fragments, amino acid 
substitutions, amino acid deletions and amino acid 
additions of the amino acid sequence for the viral 
proteins from the human herpesvirus. For example, it 
is known in the protein art that certain amino acid 
residues can be substituted with amino acids of 
s^irrlar size and polarity without an undue effect upon 
the biological activity of the protein. The human 
herpesvirus proteins have significant tertiary 
s -^. c - ure and the epitopes are usually conformational. 
Thus, modifications should generally preserve 
conformation to produce a protective immune response. 

B. Antibody Prophylaxis 



Therapeutic , intravenous, polyclonal or monoclonal 
antibodies can been used as a mode of passive 

k.rrmsv^ -"-a" 1 diseases including 
immunotherapy of ner?es\ 

t>— < natal varicella and CMV. Immune cioouiir. trom 
o^sons previously infected with the human herpesvirus 
and bearing a suitably high titer cf antiooaies 
against the virus can be given in combination witn 
antiviral agents (e.g. ganciclovir), or in combination 
with other modes of immunotherapy that are currently 
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being evaluated for the treatment of KS, which are 
taraeted to modulating the immune response (i.e. 
treatment with copolymer- 1, antiidiotype monoclonal 
antibodies. T cell "vaccination"). Antibodies to 
5 human herpesvirus can be administered to the patient 
as described herein. Antibodies specific for an 
epitope expressed on cells infected with the human 
heroesvirus are preferred and can be obtained as 



described above . 



A polypeptide, analog or active fragment can be 
formulated into the therapeutic composition as 
neutralized pharmaceutically acceptable salt forms. 
Pharmaceutical^ acceptable salts include the acid 
.5 addition salts (formed with the free amino groups of 

the polypeptide or antibody molecule) and which are 
formed with inorganic acids such as, for example, 
hydrochloric or phosphoric acids, or such organic 
acids as acetic, oxalic, tartaric, mandelic, and the 
2C like. Salts formed from the free carboxyl groups can 

also be derived from inorganic bases such as, for 
example, sodium, potassium, ammonium, calcium, or 
ferric hydroxides, and such organic bases as 
isopropylamine, trimethylamine , 2 -ethylar.ino ethanol, 
tz. histidine, procaine, and the like. 

C. Monitoring therapeutic efficacy 

This invention provides a method for monitoring the 
therapeutic efficacy of treatment for Kaposi^ s 
sarcoma, which comprises determining in a first sample 
from a subject with Kaposi's sarcoma the presence of 
-he isolated DNA molecule, administering to the 
subject a theraoeutic amount of an agent such that the 
ag-rt is contacted to the cell in a sample, 
determining after a suitable period of time the amount 
~* isolated DNA molecule in the second sample from 
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the treated subject, and comparing the amount cf 
isolated DNA molecule determined in the first sample 
with the amount determined in the seoond sample , a 
difference indicating the effectiveness of the agent, 
5 thereby monitoring the therapeutic efficacy of 

treatment for Kaposi's sarcoma. As defines, .nereir. 
'•amount" is viral load or copy number. Methods cf 
determining viral load or copy number are known to 
those skilled in the art. 
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V tt. Qr~»r.ino Assays ?or pha ^-ma nautical Aaer.ts. 
T -,rprpqf in Alleviating the ? vmntoms of KS . 

Since an agent involved in the causation or 
progression of KS has been identified and described 
"here. assays directed to identifying potential 
Dharmaceutical agents that inhibit the biological 
activity of the agent are possible. KS drug screening 
assays which determine whether or not a drug has 
activity against the virus described herein are 
contemplated in this invention. Such assays comprise 
-ncubatina a compound to be evaluated for use in KS 
treatment" with cells which express the KS associated 
human herpesvirus proteins or peptides and determining 
therefrom the effect of the compound on the activity 
such aaent. In vitro assays in which the virus is 
maintained in suitable cell culture are preferred, 
though in vivo animal models would also be effective. 

Compounds with activity against the agent of interest 
or oeotides from such agent can be screened m in 
as well as in vivo assay systems. In vitro 
assays include infecting peripheral blood leukocytes 
o- susceptible T cell lines such as KT-4 with the 
3, aaent of interest in the presence of varying 

concentrations of compounds targeted against viral 
replication, including nucleoside analogs, chain 



WO 96/06159 



PCT/US95/10194 



86 

terminators, antisense oligonucleotides and random 
polypeptides (Asada, H. et al. [7]; Kikuta et al . [48] 
both incorporated by reference herein) . Infected 
cultures and their supernatants can be assayed for the 
total amount of virus including the presence cf the 
viral genome by quantitative PCR, by dot blot assays, 
cr by using immunologic methods. For example, a 
culture of susceptible ceils could be infected with 
the human herpesvirus in the presence cf various 
concentrations of drug, fixed on slides after a period 
of days, and examined for viral antigen by indirect 
immunofluorescence with monoclonal antibodies to viral 
peptides ([48], supra. Alternatively, chemically 
adhered MT-4 cell monolayers can be used for an 
infectious agent assay using indirect 
immunof luorescent antibody staining to search for 
focus reduction (Higashi, K. etal. [36], incorporated 
by reference herein) . 

2 0 As an alternative to whole cell in vitro assays, 

purified enzymes isolated from the human herpesvirus 
can be used as targets for rational drug design to 
determine the effect of the potential drug or. enzyme 
activity, such as thymidine phosphotransferase or DNA 

25 polymerase. The genes for these two enzymes are 

crovided herein. A measure of enzyme activity 
indicates effect on the agent itself. 

Drug screens using herpes viral products are known and 
30 have been previously described in E? 0514530 (herpes 

proteases) and WO 94/04920 (U L 13 gene product) . 

This invention provides an assay for screening an— -KS 
chemotherapeutics. Infected cells can be incubated in 
3 5 the presence of a chemical agent that is a potential 

chemotherapeutic against KS (e.g. acyclo-guanosine) . 
The level of virus in the cells is then determined 
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after several days by I FA for antigens or Southern 
blottina for viral genome or Northern blotting ror 
KRNA and compared to control cells. This assay car. 
C kiy screen large numbers of chemical cor.pour.2s 
5 that may be useful against KS . 
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:his invention provides an assay system tnat 
is employed to identify drugs cr other molecules 
caoabie cf binding to the DNA molecule cr proteins, 
either in the cytoplasm or in the nucleus, thereby 
inhibiting or potentiating transcriptional activity. 
Such assay would be useful in the development cf drugs 
that would be specific against particular cellular 
activity, or that would potentiate such activity, in 
time or in level of activity. 

This invention is further illustrated in the 
Experimental Details section which follows. This 
section is set forth to aid in an understanding of the 
invention but is not intended to, and should not be 
construed to, limit in any way the invention as set 
forth m the claims which follow thereafter. 

TTYPBRIMENT^L DETAI L SECTION I; 

.yppri^ent 1: Representat ioixal difference analysis 

(RDA) to identify and characterize 
unique DNA sequences in KS tissue 

3 0 To search for foreign DNA sequences belonging to an 

infectious agent in A-i-s-i^s. - 

difference analysis (RDA) was employed :s identify and 
characterize unique DNA sequences in KS tissue that 
a— either absent or present in low copy number -n 
^--diseased tissue obtained from the same patient 
[55] This method can detect adenovirus genome aaaec 
s :, B i. cooy to human DNA but has not been used to 
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identify previously uncultured infectious agents. RDA 
is oerformec by making simplified "representations" of 
genomes from diseased and normal tissues from the same 
individual through PCR amplification cf short 
5 restriction fragments. The DNA representation from 

zhe diseased tissue is then ligated to a priming 
sequence and hybridized to an excess cf unligated, 
normal tissue DNA representation. Only unique 

sequences found in the diseased tissue have priming 
sequences ou both DNA strands and are preferentially 
amplified during subsequent rounds of PCR 
amplification. This process can be repeated using 
different ligated priming sequences to enrich the 
sample for unique DNA sequences that are only found in 
the tissue of interest. 
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DNA (10 fig) extracted from both the KS lesion and 
unaffected tissue were separately digested to 
completion with Bam HI (20 units/fig) at 37° C for 2 
hours and 2 M9 of digestion fragments were ligated to 
NSanl2 and N3am24 priming sequences [primer sequences 
described in 58] . Thirty cycles of PCR amplification 
were performed to amplify "representations" of both 
aer.omes. After construction of the genomic 

representations, KS tester amplicons between 150 and 
1500 bp were isolated from an agarose gel and NBam 
oriming sequences were removed by digestion with Bam 
HI. To search for unique DNA sequences not found in 
non-KS driver DNA, a second set of .priming sequences 
(JBaml2 and JBam24) was ligated onto only the KS 
tester DNA amplicons (Figure 1, lane 1). 0.2 M 9 of 
1<n:e d KS lesion amplicons were hybridized to 20 n<3 
o/ unligated, normal tissue representational 
amolicons. An aliquot of the hybridization product 
was then subjected to 10 cycles of PCR amplification 
us^ng JBam24, followed by mung bean nuclease 
diaestion. An aliquot of the mung bean-treatea 
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difference product was then subjected to IS more 
cycles of PCR with the JBam24 primer (Figure 1, lane 
2). Amplification products were redigested with Bam 
-I and 200 na of the digested product was Iigateo tc 
5 RBarr.12 and R3am24 primer sets for a second round or 

hvhridizaticn and PCR ampiif icat ion (Figure 1, lane 
This enrichment procedure was repeatec a tnirc 
time using the J3am primer set (Figure 1, lane 4). 
Both the original driver and the tester DNA samples 
(Table 2, Patient A) were subsequently found tc 
contain the AIDS-KS specific sequences KS3303am and 
>;SS313am {previously identified as KS627Bam) 
indicating that RDA can be successfully employed when 
the target sequences are present in unequal copy 
number in both tissues. 

The initial round of DNA ampiif ication- hybridization 
from KS and normal tissue resulted in a diffuse 
banding pattern (Figure 1, lane 2), but four bands at 
approximately 3B0, 450, 540 and 6BC bp were 
identifiable after the second amplification- 
hybridization (Figure 1, lane 3). These bands became 
discrete after a third round of ampiif ication- 
hvbridization (Figure 1, lane 4). Control RDA, 
performed by hybridizing DNA extracted from AIDS - KS 
tissue against itself, produced a single band at 
approximately 540 bp (Figure I, lane 5) . The four KS- 
associated bands (designated KS3303am, KS390Bam, 
KS480Bam, KS€273am after digestion of the two flanking 
2 6 bp ligated priming sequences with 3air< HI) were 
purified and cloned by insertion into the pCRII 
~ vec - cr . pcr products were cloned m the pCRII vector 
using the TA cloning system (Invitrogen Corporation, 
San Diego, CA) - 
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^ P ri mg nt 2_l Determination of the specificity of 

AIDS-KS unique sequences. 

To determine the specificity of these sequences for 
5 AIDS-KS, random-primed S2 P-labeled inserts were 

hybridized to Southern blots of DNA extracted from 
cryocreserved tissues obtained from patients with and 
without AIDS. All AIDS-KS specimens were examined 
microscopically for morphologic confirmation of KS and 
10 immunohistochemically for Factor VIII, Ulex europaeus 

and CD34 antigen expression. One of the AIDS-KS 
specimens was apparently mislabeled since KS tissue 
was not detected on microscopic examination but was 
included in the KS specimen group for purposes of 
15 statistical analysis. Control tissues used for 

comparison to the KS lesions included 56 lymphomas 
from patients with and without AIDS, 19 hyperplastic 
lymph nodes from patients with and without AIDS, 5 
vascular tumors from nonAIDS patients and 13 tissues 
2C infected with opportunistic infections that commonly 

occur in AIDS patients. Control DNA was also 
extracted from a consecutive series of 4 9 surgical 
biopsy specimens from patients without AIDS. 
Additional clinical and demographic information on the 
25 specimens was not collected to preserve patient 

cor.f ident iality . 

The tissues, listed in Table 1, were collected from 
diagnostic biopsies and autopsies between 1963 and 

30 1993 and stored at -70°C. Each tissue sample was from 

a different patient, except as noted in Table 1. Most 
of the 27 KS specimens were from lymph nodes dissected 
-nde- surgical conditions which diminishes possible 
contamination with normal skin flora. All specimens 

^5 were diaested with 3am HI prior to hybridization. 
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KS390Bam and KS480Bam hybridized nonspecif ically to 
both KS and non-KS tissues and were not further 
characterized. 20 of 27 (74%) AIDS-KS DNAs hybridizes 
with variable intensity to both KS330Bam and KS627Ban, 
S and one additional KS specimen hybridized only to 

KSS27Sam by Southern blotting (Figure 2 and Table I) • 
lr. contrast to AIDS-KS lesions, only 6 of 3 ? (15%; 
non-KS tissues from patients with AIDS hybridized tc 
the KS330Bam and KS627Bam inserts (Table 1) . 

Specific hybridization did not occur with lymphoma or 
i'ymoh node DNA from 36 persons without AIDS or with 
^'-r 0 l DNA from 4 9 tissue biopsy specimens obtained 
' rrom a consecutive series of patients. DNA extracted 
15 ;rom several vascular tumors, including a 

hemangiopericytoma, two angiosarcomas and a 
lymphangioma, were also negative by Southern blot 
hvbridization. DNA extracted from tissues with 
opportunistic infections common to AIDS patients, 
20 including 1 acid- fast bacillus (undetermined species) , 

1 cytomegalovirus, 1 cat-scratch bacillus, 2 
crvptocoocus and 1 toxoplasmosis infected tissues, 
were negative by Southern blot hybridization to 
KS33C5am and K3627Bam (Table 1) . 
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Tab i e i. Southern blot hybridization for KS3303arr^anc 
K£o273am and PCR amplification ror Ki^C :? . 
in human tissues from individual patients. 



"issue 



n KS3 3 0Batn Southerr. KS627Bam Southern KS33C : „ 

hvfcridiza^ior. r. t\\ hybridisation r. ( h ' positive 

;-n 5 .« 27* 20 (74) 21 (76) 25 (S3) 



AIDS 

lymphomas 

AIDS 12 
lymph nodes 

Non- AIDS 2 9 

Lymphomas 

Non-AIDS 7 
lymph nodes 

Vascular 4§ 
rumors 



27: 3 (11) 2 (11) 2 

12 3 (25) 3 (25) 3 (25) 

0 (0) 0 (0) o (0) 

0 (0) 0 (0) C (0) 

0 (0) 0 (0) 0 (0) 

0 (0) 0 (0) 



Opportunistic 1311 0 (0) 
infections 

Consecutive 49t" 0 (0) 0 (0) 0 (0) 

surgical biopsies 
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Legend ^ Table 1: 

* Includes one AIDS-KS specimen unamplif iable :or p = : 
exon 6 and one tissue which on microscopic exammatic: 
did nor have any detectable KS tissue present- 3ct: 
of these samples were negative by Southern bio: 
hvbridization to KS3303am and KS627Bam and by ~C: 
amplification for the KS33C- 3 , amplicon. 



10 : Includes 7 small non-cleaved ceil lymphomas, 

diffuse large cell and immunobiastic lymphomas. Three 
of the lymphomas with immunobiastic morphology were 
positive for KS330Bam and KS 6 2 73am . 

15 t includes 13 anaplastic large cell lymphomas, 4 

diffuse large cell lymphomas, 4 small lymphocytic 
lymphomas /chronic lymphocytic leukemias , 3 hairy cell 
leukemias, 2 monocytoid B-cell lymphomas, 1 follicular 
small cleaved cell lymphoma, 1 Burkitt's lymphoma, 1 

2 0 plasmacytoma . 

§ Includes 2 angiosarcomas, I hemangiopericytoma and 
1 lymphangioma. 

•> = z Includes 2 cryptococcus , 1 toxoplasmosis, 1 cat- 

scratch bacillus, 1 cytomegalovirus, I Epsiem-Barr 
virus, and 7 acid-fast bacillus infected tissues. In 
addition, pure cultures of Mycobacterium avium- complex 
were negative by Southern hybridization and PCR, and 

3: pure cultures of Mycoplasma penetrans were negative by 



* Tissues included skin, appendix, kidney, prostate, 
hernia sac, lung, fibrous tissue, gallbladder, coior., 
-creskin, thyroid, small bowel, adenoid, vein, 
axillary tissue, lipoma, heart, mouth, hemorrhoid, 
cseudoaneurysm and fistula track. Tissues were 
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collected from a consecutive series of biopsies 
patients without AIDS but with unknown HIV serostatus. 



5 **Apparent nonspecific hybridization at approximately 

20 Kb occurred in 4 consecutive surgical biopsy DNA 
samples: one colon and one hernia sac DNA sample 
hybridized to KS3303am alone, another hernia sac DNA 
sample hybridized to KS6273am alone and one appendix 
10 DNA* sample hybridized to both KS3303am and KS6273am. 

These samples did not hybridize in the 3 30-63 0 bp 
range expected for these sequences and were PCR 
negative for KS330 23 «. 
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In addition, DNA from Epstein-Barr virus- infected 
peripheral blood lympho eyres and pure cultures or 
Mycobacterium avium- complex were also negative by 
Sou-hern hybridization. Overall, 20 of 27 (74%: A1DS- 
KS specimens hybridized to KS3303am and 21 of 27 (7S%^ 
a:DS~-KS specimens hybridized to KS£273am, compared to 
only € of 142 (4%) non-KS human DNA control specimens 
7x=-85.02. p< 10^ and x-92.4, p< 10" 7 respectively). 

10 Tne sequence copy number in the AIDS-KS tissues was 

estimated by simultaneous hybridization with KS330Bam 
and a 44 0 bp probe for the constant region of the T 
cell receptor £ gene [76] . Samples in lanes 5 and 6 
of Figures 2A-2B showed similar intensities for the 

15 zwo p " ro bes indicating an average copy number of 

approximately two KS330Bam sequences per cell, while 
remaining tissues had weaker hybridization signals for 
the K£330Bam probe. 



Fv pp.riment 3: Characterization of KS3 3 0Bam 

KS627Bam 



and 



To further characterize KS330Bam and KSS27Bam, six 
clones for each insert were sequenced. The Sequenase 

25 version 2.0 (United States Biochemical , Cleveland, OH) 

system was used and sequencing was performed according 
to manufacturer's instructions. Nucleotides sequences 
were confirmed with an Applied Biosystems 373A 
Sequencer in the DNA Sequencing Facilities at Columbia 

3 0 University. 

•-;=•. 3 03am is a 330 bp sequence with 51% G:C content 
(Piaure 3B) and KS627Bam is a 627 bp sequence with a 
63%' G:C content (Figure 3C) . KS33 03am has 54% 
nucleotide identity to the BDLF1 open reading frame 
(ORF) of E P stein-3arr virus (EBV) - Further analysis 
revealed that both KS3303am and KS = 273am coae for 
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amino acid sequences with homology to polypeptides of 
viral origin. SwissProt and PIR protein databases 
were searched for homologous ORF using BLASTX [3] . 

5 K£33 03am is 51% identical by amino acid homology to a 

portion of the ORF 2 6 open reading frame encoding the 
caosid protein VP23 (NCHI g.i. 60346, bp 46024 - 
46935) of herpesvirus saimiri [2] , a gammaherpesvirus 
which causes fulminant lymphoma in New world monkeys. 
10 This fragment also has a 3 9% identical amino acid 

sequence to the theoretical protein encoded by the 
homologous open reading frame BDLF1 in EBV (NC3I g.i. 
59140, bp 132403 -133307) [9] . The amino acid 
sequence encoded by KS62 7Bam is homologous with weaker 
identity (31%) to the tegument protein, gpl40 (ORF 29, 
NCBI g.i. 60396, bpl08782 -112681) of herpesvirus 
saimiri . 

Sequence data from KS330Bam was used to construct PCR 
primers to amplify a 234bp fragment designated KS330 23 « 
(Fiaure 3B) . The conditions for PCR analyses were as 
fellows: 94°C for 2 min (1 cycle); 94°C for 1 min, 
55°C for 1 min, 72°C for 1 min (35 cycles) ; 72°C 
extension for 5 min (1 cycle) . Each PCR reaction used 
0.1 ^g of genomic DNA, 50 pmoles of each primer, 1 
unit of Taq polymerase, 100 uK of each 

deoxynucleotide triphosphate, 50 mM KCl, lOmM Tns-HCl 
(pH 9-0), and 0.1% Triton-X-100 in a final volume of 
2*5 Ml- Amplifications were carried, out in a Perkin- 
Eimer 480 Thermocycler with 1-s ramp times between 
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steos . 



Although Southern blot hybridization detected the 
KS330Bam sequence in only 20 of 27 KS tissues, 25 or 
the 27 tissues were positive by PCR amplification for 
KS330 23 < (Figures 4A-43) demonstrating that KS3303am is 
oresent in some KS lesions at levels below the 
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-r-eshold for detection by Southern =-='■ 
hybridization. All KS33 0 H< PCR products hybridized to 
a'"P end- labelled 25 bp internal oligomer, confi- ...mg 
-h» specificity of the PCR (Figure 4B) . Of the two 
^c-KS specimens negative for KS3 3 0 : 34 . both specimens 
sheared "to be negative for technical reasons: one 
had no microscopically detectable KS tissue ir. tne 
frozen sample {figures 4A-4E, lane 3), ano tne ctr.er 
(Figures 4A-4B, lane 15) was negative in the control 
o C R~ amplification for the p33 gene indicating either 
DNA degradation or the presence cf PCR inhibitors in 
the sample. PCR amplification of the P 53 tumor 
suppressor gene was used as a control for DNA quality. 
Sequences of p53 primers from P6-5. 5'- 
ACAGGGCTGGTTGCCCAGGGT-3' (SEQ ID No: 44! ; and P6-3. 5'- 
AGTTGCAAACCAGACCTCAG-3' (SEQ ID NO: 45) [25]. 

Except for the 6 control samples from AIDS patients 
-hat* were also positive by Southern blot 
hybridization, none of the other 135 control specimens 
were positive by PCR for KS3 3 0 234 . ^ °' these 
specimens were amplifiable for the F 53 gene, 
indicating that inadequate PCR amplification was not 
-•-e reason for lack of detection cf KS3 3C J3< in the 
control tissues. Samples containing DNA from two 
-Indicate KS agents, E3V and Mycoplasma penetrans 
( A t.— Accession No. 55252), a pathogen commonly found 
in the genital tract of patients with AIDS-KS [59] 
w .~ als'o neaative for amplification cf KS330 3J4 . In 
addition, several KS specimens were tested using 



commercial PCR primers (Scratagene, ^V^Z^^i 
soecif :c for mycoplasmas a and primers s?ec::ic r=r -ne 
E3NA-2, E3NA-3C and E3ER regions cf E3V and were 
negative [57] . 

Overall, DNA from 25 (53%) of 27 AIDS-KS tissues were 
positive by PCR compared with DNA from 6 (4%) of 14. 
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control tissues, including 6 (15%) of 3 9 non-KS lymph 
nodes and lymphomas from AIDS patients (^ = 38.2, p < 
I0- 6 ) 0 of 3 6 lymph nodes and lymphomas from nonAIDS 
oatients (* 2 =55.2, p < 10") and 0 of 49 consecutive 
bioosy specimens (x^.7, p < 10' 7 ) . Thus, KS33C a „ was 
found in all 25 amplifiable tissues with 
microscopically detectable AIDS-KS, but rarely 
occurred in non-KS tissues, including tissues from 
AIDS patients. 

Of the six control tissues from AIDS patients that 
were positive by both PCR and Southern hybridization, 
two patients had KS elsewhere, two did not develop KS 
and complete clinical histories for the remaining two 
patients were unobtainable. Three of the six positive 
"non-KS tissues were lymph nodes with follicular 
hyperplasia taken from patients with AIDS. Given the 
high prevalence of KS among patients with AIDS, it is 
possible that undetected microscopic foci of KS were 
present in these lymph nodes. The other three 
positive tissue specimens were B ceil immunoblastic 
lymphomas from AIDS patients. It is possible that the 
putative KS agent is also a cofactor for a subset of 
AIDS - associated lymphomas [16, 17, 80]. 

To determine whether KS330Bam and KS6273am are 
portions of a larger genome and to determine the 
proximity of the two sequences to each other, samples 
of KS DNA were digested with Pvu II restriction 
enzymes. Digested genomic DNA from three AIDS-KS 
samples were hvbridized to KS330Bam and K56 273am by 
Southern blotting (Figure 5). These sequences 
hybridized to various sized fragments of the digested 
KS DNA indicating that both sequences are fragments of 

Pi*fpr e nces in the KS330Bam 
"arger genomes. DiLterenceb 

hybridization pattern to Pvu II digests of the three 
AIDS-KS specimens indicate that polymorphisms may 
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ccur in the larger genome. Individual fragments from 
the digests failed tc simultaneously hybridize with 
both KS3303am and KS627Bam / demonstrating that tnese 
Bam HI restriction fragments are not adjacent to 



tw 

one another 



If KS2 3 03arr. and KS£2 73air are heritable polymorphic DNA 
-•arkers for KS , these sequences should be uniformly 
detected at ncn-KS tissue sites in patients witn AIDS- 
KS. Alternatively, if KS3303am and KS6273arr. are 
sequences specific for an exogenous infectious agent, 
it is likely that some tissues are uninfected and lack 
detectable KS3303am and KS6273am sequences. DNA 
extracted from multiple uninvoived tissues from three 
patients with AIDS-KS were hybridized to ^-labelled 
KS3303am and KS627Bam probes as well as analyzed by 
?CR using the KS3 3 0 234 primers (Table 2). While KS 
lesion DNA samples were positive for both bands, 
unaffected tissues were frequently negative for these 
sequences. KS lesions from patients A, 3 and C, and 
uninvoived skin and muscle from patient A were 
positive for KS330Bam and KS627Bam, but muscle and 
brain tissue from patient B and muscle, brain, colon, 
heart and hilar lymph node tissues from patient C were 
negative for these sequences. Uninvoived stomach 
linma adiacent to the KS lesion in patient C was 
positive by PGR, but negative by Southern blotting 
w-ich suagests the presence of the sequences m this 
tissue at levels below the detection threshold for 
Southern blotting. 
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Table 2 : 



Differential detection of KS330Bam, KS627Bwn 
and KS330„ 4 sequences in KS-involved and 
non- involved tissues from three patients 
with AIDS-KS. 











KS2 3 03am 


KSc 2 75am | 


KS33C r?; 


Par lent A 




1 


KS , skin 








nl skin 




* 




nl muscle 


-t- 






Patient B 








KS, skm 




-t- 




nl muscle 


- 






nl brain 








Patient C 

KS , stomach 






4- 


nl stomach 
adjacent to KS 

nl muscle 






4- 


nl brain 
nl colon 








nl heart 








nl hilar lymph 
nodes 
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yv ppTi TnsTit 4= Subcloning and sequencing of KSHV 

KS"3 0Bam and KS627Bam are genomic fragments of a novel 
infectious agent associated with AIDS-KS. A genomic 
^b-a-y from a KS lesion was made and a phage clone 
w ^ h a 20 kb insert containing the KS3 303am sequence 
was identified. The 20 kb clone digested with PvuII 
(wh^h cuts in the middle of the KS3303am sequence) 
o-odu-ed 1.1 kb and 3 kb fragments that hybridized to 
KS330Bam. The 1.1 kb subcloned insert and -900 bp 
-om the 3 kb subcloned insert resulting in 94 04 bp of 
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contiguous sequence was entirely sequenced. This 
sequence contains partial and complete open reading 
frames homologous to regions in gamma herpesviruses. 

5 The KS33 0Bam sequence is an internal portion of an 915 

bp OR? with 55-56% nucleotide identity to the ORF26 
arid BDLF1 genes of KSVSA and E3V respectively. The 
E3V and HSVSA translated amine acid sequences rcr 
these OR? s demonstrate extensive homology with the 
10 amino acid sequence encoded by the KS -associated 91S 

bo ORF (Figure 6). In HSVSA, the V?23 protein is a 
late structural protein involved in capsid 
construction. Reverse transcriptase (RT) -PCR of mRNA 
from a KS lesion is positive for transcribed KS330Bam 
mRNA and that indicates that this ORF is transcribed 
in KS lesions. Additional evidence for homology 
between the KS agent and herpesviruses comes from a 
comparison of the genomic organization of other 
potential ORFs on the 9404 bp sequence .{Figure 3A) 
, G T he 5' terminus of the sequence is composed 

nucleotides having 66-67% nucleotide identity and 68- 
71% amino acid identity to corresponding regions of 
the major capsid protein <MCP) ORFs for both E3V and 
HSVSA . This putative MCP ORF of the KS agent lies 
25 immediately 5' to the BDLF1/ORF26 homoiog which is a 

conserved orientation among herpesvirus subfamilies 
for these two genes. At the 3' end of this sequence, 
the reading frame has strong ammo acid and nucleotide 
homoioov to HSVSA ORF 27. Thus, KS-associated DNA 
3C sequences at four loci in two separate regions witn 

h~ ^-va e-.T - -r-zi i aori^Tpcc have been 
i^i„^~~ — _ erpes/-ra_ g_»om~ 

ident i f ied . 

;r. addition to fragments obtained fro-, Pvu 11 digest 
35 cf the 21 Kb phage insert described above, fragments 

obtained from a 3amHI/NotI digest were also subcioned 
into p3luescript (Stratagene, La Jolla, CA) . Tne 
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termini of these sub cloned fragments were sequenced 
and were also found to be homologous to nucleic acid 
sequence EBV and HSVSA genes. These homologs have 
been used to develop a preliminary map of subclones 
- fraaments (Figure 9) . Thus, sequencing has revealed 

that the KS agent maintains co-linear homology to 
aamma herpesviruses over the length of the 21 Kb phage 
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Pv periment 5: Determination of the phylogeny of KSHV 



Regions flanking KS3303am were sequenced and 
characterized by directional walking. This was 
performed by the following strategy: 1) KS genomic 
15 libraries were made and screened using the KS330Bam 

fragment as a hybridization probe, 2) DNA inserts from 
phage clones positive for the KS330Bam probe were 
Isolated and digested with suitable restriction 
enzyme (s), 3) the digested fragments were subcloned 
2C into pBluescript (Stratagene, La Jolia, CA> , and 4) 

the subclones were sequenced. Using this strategy, 
-he major capsid protein (MC?) ORF home log was the 
first important gene locus identified- Using 
secuenced unique 3' and 5' end- fragments from positive 
25 phage clones as probes, and following the strategy 

above a KS genomic library are screened by standard 
methods for additional contiguous sequences. 



For sequencing purposes, restriction fragments are 
subcloned into phagemid pBluescript KSt , pBluescript 
KS-, pBS- , or p3S- (Stratagene) or into piasmid pUClS 
or pUC19. Recombinant DNA was purified through CsCl 
density gradients or by anion-exchange chromatography 
CQiagen) - 

Nucleotide sequenced by standard screening methods of 
cloned fragments of KSHV were done by direct 
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sequencing of double- stranded DNA using 
oligonucleotide primers synthesized commercially zc 
"walk" along the fragments by the dideoxy-nuclectide 
chain termination method. Junctions between clones 
5 are confirmed by sequencing overlapping clones. 

Targeted homologous genes in regions flanking K=3 3 05am 
include, but are not limited to: II -1G r.omc-og, 
thymidine kinase (TK) , g65 ; g35, gH, capsid proteins 
1C and MCF. TK is an early protein of the herpesviruses 

functionally linked to DNA replication and a target 
enzyme for anti-herpesviral nucleosides. TK 
phosphorylates acyclic nucleosides such as acyclovir 
"which in turn inhibit viral DNA polymerase chain 
extension. Determining the sequence of this gene will 
aid in the prediction of chemotherapeutic agents 
useful against KSHV. TK is encoded by the EBV BXLF1 
ORF located -9700 bp rightward of BDLF1 and by the 
HSVSA ORF 21 -9200 bp rightward of the ORF 26. A 
subcioned fragment of KS5 was identified with strong 
homology to the EBV and HSVSA TK open reading frames. 

cr£ 5 is a late glycoprotein involved in membrane fusion 
homologous to gH in KSV1 . In EBV, this protein is 
encoded by BLXF2 ORF located -7600 bp rightward of 
BDLF1, and in HSVSA it is encoded by ORF 22 located 
-7100 bp rightward of ORF 2 6 . 

g3 5 is a late EBV glycoprotein found in virion and 
oiasma membrane. It is encoded by BDLF3 ORF which is 
1300 bp leftward of 3DLF1 m EBV. There is no 5DLF3 
homo log in HSVSA . A subcioned fragment has aireaoy 
been identified with strong homology to the EBV gp35 
ooen reading frame. 

Major capsid protein (MCP) is a conserved 150 KDa 
ororein which is the major component: of herpesvirus 
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capsid. Antibodies are generated against the MC? 
during natural infection with most herpesviruses. The 
terminal 1026 bp of this major capsid gene homolog in 
KSHV have been sequenced. 

Taraeted homologous genes/loci in regions fiar.Kinc 
KS 6 2 73am include, but are not limited to: termina- 
reiterated repeats, LM?I , E3£Rs and Ori ?. Termma. 
reiterated sequences are present in all herpesviruses. 
In E3V , randomly reiterated 0.5 Kb long terminal 
repeats flank the ends of the linear genome and become 
joined in the circular form. The terminal repeat 
region is immediately adjacent to BNRF1 in EBV and ORF 
75 in HSVSA. Since the number of terminal repeats 
varies between viral strains, identification of 
terminal repeat regions may allow typing and clonal icy 
studies of KSHV in KS legions. Sequencing through the 
terminal repeat region may determine whether this 
virus is integrated into human genome in KS . 

* 

LM?I is an latent protein important in the 
transforming effects of EBV in 3urkitt's lymphoma. 
This gene is encoded by the EBV BNRF1 ORF located 
-2000 bp rightward of tegument protein ORF 3NRF1 in 
the circularized genome. There is no LM?1 homolog in 
HSVSA . 

EBERs are the most abundant RNA in latently EBV 
infected cells and Ori-P is the origin of replication 
for latent EBV genome. This region is located between 
-4000-9000 bp 

are no corresponding regions in HSVSA 



leftward of the 3NRF1 ORF in EBV ; tnere 



The data indicates that the KS agent is a new human 
herpesvirus related to gamma herpesviruses r.3V and 
HSVSA The results are not due to contamination or to 
dental co-infection with a known herpesvirus since 
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the sequences are distinct from all sequenced 
herpesviral genomes (including EBV, CMV # KKVc and 
KSVSA) and are associated specifically with KS in 
three separate comparative studies. Furthermore, ?CR 
5 testing of KS DNA with primers specific for E3V-1 ana 

Z3V-2. failed to demonstrate these viral genomes in 
these tissues. Although KSHV is homologous to E3V 
rerions, the sequence does not match any ether known 
sequence and thus provides evidence for a new viral 
10 aenome, related to but distinct from known members of 

the herpesvirus family. 

Experiment 6 : Serological studies 

Indirect: immunofluorescence assay 1 I F A ) 

Virus -containing cells are coated to a microscope 
slide. The slides are treated with organic fixatives, 
dried and then incubated with patient sera. 
Antibodies in the sera bind to the ceils, and then 
excess nonspecific antibodies are washed off. An 
ar.tihuman immunoglobulin linked to a f iucrochrome , 
such as fluorescein, is then incubated with the 
slides, and then excess fluorescent immunoglobulin is 
washed off. The slides are then examined under a 
microscope and if the cells fluoresce, then this 
indicates that the sera contains antibodies directed 
aaainst the antigens present in the cells, such as the 
virus. 



immunofluorescence assay CFA) was 
ie Body Cz 
( 3CEL- 1 ) , which is a 



erformed on -he Body Cavity-Based Lymphoma cell line 



indirect 

naturally transformed -3V 
infected ( nonproducing ) B ceil line, using 4 KS 
patient sera and 4 control sera (from AIDS patients 
without KS) . Initially, both sets of sera showed 
similar 



o ar levels of antibody binding. To remove 
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nonspecific antibodies directed against E3V and 
lymphocyte antigens, sera at 1:25 dilution were pre- 
adsorbed using 3xl0 6 1% paraformaldehyde- fixed Raji 
cells per ml of sera. BCBL1 cells were fixed with 
5 ethanol/acetone, incubated with dilutions of patient 

sera, washed and incubated with fluorescein- cor. jugated 
goat anti-human IgG. Indirect immuncf lucres cent 

staining was determined. 



10 



2C 



Table 3 shows that unabsorbed case and control sera 
have similar end-point dilution indirect 
immunofluorescence assay ( I FA) titers against the 
3C3L1 cell line. After Raji adsorption, case sera 
have four-fold higher I FA titers against 3C3L1 cells 
than control sera. Results indicated that pre- 
adsorption against paraf ormaldehyde - f ixed Raji cells 
reduces fluorescent antibody binding in control sera 
but do not eliminate antibody binding tc KS case sera. 
These results indicate that subjects with KS have 
specific antibodies directed against the KS agent that 
can be detected in serological assays such as I FA, 
Western blot and Enzyme immunoassays (Table 3). 
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Table 3 : Indirect immunofluorescence end-point titers 
for KS case and non-KS control sera against 
the BCBL-1 cell line 

Sera No . gtarus* Pre- adsorrt ior. Pcs-.-a^crriicr. ** 

KS > 1 : 4 0 0 > J- • 4 0 0 

2 KS 1:10 0 

KS 1:200 

2 KS > 1:400 



100 
1 0 0 
200 



Control > 1:400 
Control 1:50 
Control 1:100 



e Control 1:200 



50 
50 
^ f< 

SO 



Legend Table 3 : 

20 * KS= autopsy -confirmed male, AIDS patient 

Control =autopsy- confirmed female, AIDS patient, 

no KS 

Adsorbed against RAJ I cells treated with 1% 
2= paraformaldehyde 

"mmunoblottinc f "Western blot") 

Virus-containing cells or purified virus (or a portion 
2C of the virus, such as a fusion protein) is 

eiectrophoresed on a polyacrylamide gel to separate 
the protein antigens by molecular weight. The 
proteins are blotted onto a nitrocellulose or nylon 
membrane, then the membrane is incubated in patient 
25 sera. Antibodies directed against specific antigens 

are developed by incubating with a anti-human 
immunoglobulin attached to a reporter enzyme, such as 
a peroxidase. After developing the membrane, each 
antiaen reacting against antibodies m patient sera 
4 0 shows up as a band on the membrane at the 

corresponding molecular weight region. 
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Enzyme immunoassay c pta or ELISA 1 ' ? 

Virus -containing cells cr purified virus (or a portion 
of the virus, such as a fusion protein) is coated to 
the bottom of a 96-well plate by various means 
(generally incubating in alkaline carbonate buffer) . 
The oiates are washed, then the wells are incubated 
with patient sera. Antibodies in the sera directed 
against specific antigens stick on the plate. The 
wells are washed again to remove nonspecific antiboay, 
then they are incubated with a ant i human 
immunoglobulin attached to a reporter enzyme, such as 
a peroxidase. The plate is washed again to remove 
nonspecific antibody and then developed. Wells- 
containing antigen that is specifically recognized by 
antibodies in the patients sera change color and can 
be detected by an ELISA plate reader (a 
spectrophotomer ) . 

All three of these methods can be made more specific 
by pre -incubating patient sera with uninfected cells 
:o adsorb out cross-reacting antibodies against the 
cells or against other viruses that may be present in 
rhe cell line, such as EBV . Cross - react ing antibodies 
can potentially give a falsely positive test result 
(i.e. the patient is actually not infected with the 
virus but has a positive test result because of cross- 
reacting antibodies directed against cell antigens in 
the preparation). The importance of the infection 
experiments with Raj i is that if Raji celis, or 
another well-defined cell line, can be infected, then 
the patient's sera can be pre-adsorbed against the 
uninfected parental cell line and then tested in one 
of the assays. The only antibodies left in the sera 
after pre- adsorption that bind to antigens in the 
preparation should be directed against the virus. 
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Experiment 7 : 



3CEL 1, from iymphomatous tissues belonging to a rare 
infiltrating, anaplastic body cavity lymphoma 
occurring in AIDS patients has been placed m 
„^^ nuous ce n culture and shown to be continuously 
infected with the KS agent. This cell line is also 
naturally infected with Epstein-Barr Virus ( E3V) . The 
BCBL ceil line was used as an antigen substrate to 
detect specific KS antibodies in persons infected with 
the putative virus by Western-blotting. Three 
lymphoid B cell lines were used as controls. These 
included the EBV genome positive cell line ?3K3, the 
E3V genome defective cell line Raji and the E3V genome 
negative cell line Bjab. 



Cells from late-log phase culture were washed 3 time 
with ?3S by centrif ugation at 500 g for lOmin. and 
20 suspended in sample buffer containing 5C mK Tris-HCl 

r>H S.S, 2% SDS (w/v) , 15% glycerol (v/v) , 5% 0- 
mercaptoethanoi (v/v) and 0.001% bromophenol (w/v) 
with protease inhibitor, 100 llM phenylmethylsulf onyl 
fluoride (PMSF) . The sample was boiled at 10C°C for 
25 • 5 min and centrif uged at 14 /000 g for 10 min . The 
proteins in the supernatant was then fractionated by 
sodium, . dodecyl sulf ate -polyacryiamide gel 
electrophoresis (SDS-PAGE) under reducing conditions 
with a separation gel of 15% and a stacking gel of 5% 
30 (3) , Prestained protein standards were included: 

myosin, 200 kDa ; /5-galactosidase , 115 kDA; 3SA, 78 
kDa; ovalbumin, 47.1 kDa; carbonic anhydrase , 31.4 
kDa; soybean trypsin inhibitor, 25.5 kDa, iysczyme , 
16.6 kDa and aprotinin, 8.3 kDa (Bio-Rad) . 
:mmunoblotting experiments were performed according to 
the method of Towbin et al . (4). Briefly, ^ the 
oroteins were electrophorect icaiiy transferred to 
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Hybon-C extra membranes (Pharmacia) a: 24 V fcr 7C 
min. The membranes were then dried at 3 7 °C fcr 3 0 
min, saturated with 5% skim milk in Tns-buf f ered 
saline, pH 7.4 (TBS) containing 50 mM Tris-HCI and 200 
mM NaCl, at room temperature for 1 h. The membranes 
were subsequently incubated with human sera at 
dilution 1:200 in 1% skim milk overnight at room 
temperature, washed 3 times with a solution containing 
735, 0.2% Triton X-100 and 0 . 05% skim milk and then 2 
times with TBS. The membranes were then incubated for 
2 h at room temperature with alkaline phosphatase 
conjugated goat ant i -mouse IgG + IgM + IgA (Sigma) 
diluted at 1:5000 in 1% skim milk. After repeating 
the washing, the membranes were stained with nitroblue 
tetranolium chloride and 5-bromo-4 -chloro-3 - 
indolylphosphate p-toluidine salt (Gibco BRL) . 

Two bands of approximately 226 kDa and 234 kDa were 
identified to be specifically present on the Wester- 
blot of BCBL cell lysate in 5 sera from AIDS gay man 
patients infected with KS.* These 2 bands were absent 
from the lysates of P3H3 , Raji and Bjab cell iysates . 
5 sera from AIDS gay man patients without KS and 2 
sera from AIDS woman patients without KS as well as 1 
sera from nasopharyncel carcinoma patient were not 
able to detect these 2 bands in BCBL 1, P3K3, Raji and 
Bjab cell lysates. In a blinded experiment, using the 
226 kDa and 234 kDa markers, 15 out of 16 sera from KS 
patients were correctly identified. In total, the 
226 kDa and 234 kDa markers were detected m 20 out of 
21 sera from KS patients. 

The antigen is enriched in the nuclei fraction of 
BC3L1. Enriched antigen with low background can be 
obtained by preparing nucleic from BCBC as the 
starting antigen preparation using standard, widely 
available protocols. For example, 500-750ml of BCBL- 
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at 5X10 E cells/ml can be pelleted at low speed. Tr.e 
pellet is placed in 10 niM NaCl, 10 mM Tris pK 7.5. 1 . = 
mK MgCl 2 (equi volume) * 1.0% NP-4 0 on ice for 2 0 tr.ir. 
-c lyse cells. The lysate is then spun at -500 rp- 
5 fcr'io'min. to pellet nucleic. The pellet is used m 

-he starting fraction for the antigen preparation for 

T-' r - = w- " "> -^d'jce cross- reactive 
the Western Diit. j.n-s v. — 

cytoplasmic antigens. 
1C p T .ri mm t 8; Transmission studies 

rj. jr.-°rr.ion exper iments 

3-31,1 cells were co-cultivated with Raji cell lines 
-5 separated by a 0.45 m tissue filter insert. 

Approximately, 1-2 x 10 £ BCBL1 and 2x10* Raji cells 
were co- cultivated for 2-20 days in supplemented RPMI 
alone, in 10 tig /ml 5 ' -bromodeoxyuridine (BUdR) and 0 . 6 
ug/ml 5' -flourodeoxyuridine or 20 ng/mi 12-0- 
2C retradecanoylphorbol-13 -acetate (TP A) . After 2,8,12 

cr 20 days co-cultivation, Raji cells were removed, 
wash-e and placed in supplemented RPKI 164 0 media. A 
Rail culture co-cultivated with BC3L1 in 20 ng/ml TPA 
2 days survived and has been kept in continuous 
suspension culture for >10 weeks. This cell line, 
aesianated RCC1 (Raji Co-Culture, No. 1) remains P.k 
Positive for the KS330, 34 sequence after multiple 
Passages. This cell line is identical to its parental 
Raji cell line by flow cytometry using SKA, HI, a-i and 
30 3°"k2^ lymphocyte- flow, cytometry (approximately 2%) . 

RCCl periodically undergo rapid cytolysis suggestive 
cf lytic reproduction of the agent. Thus, RCCl is a 
Ra-^i cell line newly infected with KSHV. 

3^ T h . results indicate the presence of a new human 

us. specifically a herpesvirus in KS lesions. Tne 
v-iah de=ree of association between this agent an: 
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AIDS-KS (>90%) , and the low prevalence o^ tne agent in 
non-KS tissues from immunocompromised AIDS patients, 
indicates that this agent has a causal role in AIDS-KS 
[47, 6B] . 

Fy periment 10: Isolation of KSHV 

Crude virus preparations are made from either tne 
supernatant or low speed pelleted cell fraction o: 
3C3L1 cultures. Approximately 6 5 0ml or more of log 
phase cells should be used {>5X10< cells/ml). 

For bonding whole virion from supernatant, the cell 
free supernatant is spun at 10,000 rpm in a GSA rotor 
for 10 min to remove debris. PEG- 8000 is added to 7% , 
dissolved and placed on ice for >2 . 5 hours. The PEG- 
supernatant is then spun at 10,000 xg for 30 min. 
supernatant is poured off and the pellet is dried and 
scraped together from the centrifuge bottles. The 
pellet is then resuspended in a small volume (1-2 ml) 
cf virus buffer <VB, 0.1 M NaCl , 0.01 M Tris, pH 7.5) . 
This procedure will precipitate both naked genome and 
whole virion. The virion are then isolated by 
cenrrifugation at 25,000 rpm in a 10-50% sucrose 
gradient made with VB . One ml fractions of the 
gradient are then obtained by standard techniques 
(e.g. using a f ractionator ) and each fraction is then 
tested by dot blotting using specific hybridizing 
primer sequences to determine the gradient fraction 
containing the purified virus (preparation cf the 
fraction maybe needed in order to detect the presence 
cf the virus, such as standard DNA extraction) . 

~o obtain the episomal DNA from the virus, the pellet 
cf cells is washed and pelleted in PBS, then lysed 
using hypotonic shock and/or repeated cycles of 
freezing and thawing in a small volume (<3 ml). 
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Nuclei and other cytoplasmic debris are removed by 
centrifugation at 10, OOOg for 10 min, filtration 
through a 0.45 m filter and then repeat centrifugation 
at 10, OOOg for 10 min . This crude preparation 
5 contains viral genome and soluble ceil components . 

The genome preparation can then be gently chlcrcform- 
ohencl extracted to remove associated proteins cr can 
be placed in neutral DNA buffer (1 M NaCl , 50 mK Tris, 
1C mM EDTA , pH 7.2-7.6) with 2% sodium dodecyisuif ate 
1C (SDS) and 1% sarcosyl . The genome is then banded by 

centrifugation through 10-30% sucrose gradient in 
neutral DNA buffer containing 0.15% sarcosyl at 20,000 
rpm in a 3W 27.1 rotor for 12 hours (for 4 0,000 rpm 
for 2-3 hours in an SW41 rotor) . The band is detected- 
as described above. 
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An example of the method for isolating KSHV genome 
from KSHV infected cell cultures (97 and 98) . 
Approximately 800 ml of BCBL1 cells are pelleted, 
washed with saline, and pelleted by low speed 
centrifugation. ' The cell pellet is lysed with an 
equal volume of RSB {10 mM NaCl, 10 mM Tris-HCl, 1.5 
mM MgC12, pH 7.8) with 1% NP-40 on ice for 10 minutes. 
The iysate is centrifuged at 900xg for 10 minutes to 
25 pellet nuclei. This step is repeated. To the 

supernatant is added 0.4% sodium dodecyisuif ate and 
EDTA to a final concentration of 10 mM . The 
supernatant is loaded on a 10-30% sucrose gradient in 
i.O M NaCl, ImM EDTA, 5 0mM Tris-HCl, pH 7.5. The 
gradients are centrifuged at 20,000 rpm on a SW 27.1 
rotor for 12 hours. In figure 11, 0.5 ml aiiquots of 
the gradient have been fractionated (fractions 1-62) 
with' the 30% gradient fraction being at fraction No. 
1 and the 10% gradient fraction being at fraction No. 
62. Each fraction has been dot hybridized to a 
nitrocellulose membrane and then a -P-iabeied KSHV DNA 
fragment, KS631Bam has been hybridized to the membrane 
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using standard techniques. Figure 11 shows that the 
major soiubilized fraction of the KSHV genome bands 
(i.e. is isolated) in fractions 42 through 46 cf the 
cradient with a high concentration of the genome being 
5 present in fraction 44. A second band of solubilized 

KSHV DNA occurs in fractions 2 6 through 32 . 

Experiment 11 : Purification of KSHV 

10 DNA is extracted using standard techniques from the 

RCC-1 or RCC-1 2FS cell line [27, 49, 66] . The DNA is 
tested for the presence of the KSHV by Southern 
blotting and PCR using the specific probes as 
described hereinafter. Fresh lymphoma tissue 

15 containing viable infected cells is simultaneously 

filtered to form a single cell suspension by standard 
techniques [49, 66]. The cells are separated by 
standard Ficoll -Plaque centrif ugation and lymphocyte 
layer is removed. The lymphocytes are then placed at 

20 >lx!0 € cells/ml into standard lymphocyte tissue culture 

medium, such as RMP 1640 supplemented with 10% fetal 
calf serum. Immortalized lymphocytes containing the 
KSHV virus are indefinitely grown in the culture media 
while nonimmortilized cells die during course of 

2= prolonged cultivation. 

Further, the virus may be propagated in a new cell 
line by removing media supernatant containing the 
virus from a continuously infected ceil line at a 
concentration of >lxl0 c cells/mi. The ■ media is 
centrifuged at 2000xg for 10 minutes and filtered 
through a 0.45^ filter to remove ceils. The media is 
applied in a 1 : 1 volume with cells growing at >ixl0 e 
eel Is /ml for 48 hours. The cells are washed and 
pelleted and placed in fresh culture medium, and 
tested after 14 days of growth. 
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The herpesvirus may be isolated from the cell DNA in 
the following manner. An infected cell line, which 
can be iysed using standard methods such as hyposmctic 
shocking and Dounce homogenization , is first pe__etec 
at 2000xg for 10 minutes, the supernatant is removed 
and centrifuged again at 10, OOOxg for 15 minutes tc 
remove nuclei and organelles. The supernatant is 
filtered through a 0.45^ filter and centrifuged again 
at 100, OOOxg for 1 hour tc pellet the virus. Tne 
virus can then be washed and centrifuged again at 
10C, OOOxg for 1 hour. 
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TTYPgRTMEKT^ DETAIL CTTPTTON II; 

fipmignein^ Studies : A lambda phage (KS5) from a KS 
lesion genomic library identified by positive 
5 hybridization with KS33 0Bam was digested with SariKI 

and Won I (3oehringer-Mannheim, Indianapolis IN) ; five 
fragments were gel isolated and subcloned into 
Biuescript II KS (Stratagene, La Jolla CA) . The 
entire sequence was determined by bidirectional 
10 sequencing at a seven fold average redundancy by 

primer walking and nested deletions. 

DNA sequence data were compiled and aligned using 
ALIGN t 131 -Kodak, Rochester NY) and analyzed using the 
15 Wisconsin Sequence Analysis Package Version 8-UNIX 

(Genetics Computer Group, Madison WI ) and the GRAIL 
Sequence Analysis, Gene Assembly and Sequence 
Comparison System v. 1.2 (Informatics Group, Oak Ridge 
TN) . Protein site motifs were identified using Motif 
2 0 (Genetics Computer Group, Madison WI) . 

sources o- Herpesvirus Gene Sequence Comparisons : 

Complete genomic sequences of three gammaherpesviruses 
were available: Epstein-Barr virus (E3V) , a 

25 herpesvirus of humans [4]; herpesvirus saimiri (HVS) , 

a herpesvirus of the New World monkey Saimiri sciureus 
[13 ; and equine herpesvirus 2 CEHV2 [49] ) . Additional 
thvmidine kinase gene sequences were obtained for 
al'celaphine herpesvirus 1 (AHV1 . [22] > and bovine 
30 herpesvirus 4 (BHV4 [31]). Sequences for the major 

capsid protein genes of human herpesvirus 6B and human 
herpesvirus 7 (KHV7) were from Mukai et al . [34] . The 
sources of all other sequences used are listed 
previously in McGeoch and Cook [31] and McGeoch et al . 
35 [32] . 
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pwinrronetiic Inference: Predicted amino acid sequences 
used for tree construction were based on previous 
experience with herpesviral phyiogenetic analyses 
[31] . Alignments of homologous sets of amino a::c 
5 sequences were made with the AMPS [5] and Pileup [16] 

programs. Regions of alignments that showed extreme 
divergence with marked length heterogeneity, typically 
terminal sections, were excised. Generally, positions 
in alignments that contained inserted gaps in one or 
more sequences were removed before use for tree 
construction. Phyiogenetic inference programs were 
from the Phylip set, version 3.5c [14] and from the 
GCG set [16] . Trees were built with the maximum 
parsimony (MP), neighbor joining (NJ) methods. For 
the NJ method, which utilizes estimates of pairwise 
distances between sequences, distances were estimated 
as mean numbers of substitution events per site with 
Protdist using the PAM 250 substitution probability 
matrix of Schwartz & Dayhoff [46] . Bootstrap 
analysis [15] was carried out for MP and NJ trees, 
with 100 sub-replicates of each alignment, and 
consensus trees obtained with the program Consense . 
Ir. addition the program Protmi was used to infer trees 
by the maximum likelihood (ML) method. Protmi was 
obtained form J. Adachi , Department of Statistical 
Science, The Graduate University for Advanced Study, 
Tokyo 106, Japan. Because of computational 

constraints, Protmi was used only with the 4 -species 
CS 1 ■ alignment . 
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ri^^nnhnresiB: Agarose plugs were prepared by 

^suspending BCBL-1 cells in 1% LM? agarose (Biorad, 

^cuies CA) and 0 . 9% NaCl at 42"C zo a final 

c^-tration of 2.5 x 10^ cells/mi. Solidifies 

agarose oluas were transferred into lysis buffer ( 0 . 5M 

ZD-A OH 8.0, 1% sarcosyl, proteinase K at 1 mg/ml 
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final concentration) and incubated for 24 hours. 
Approximately 1CT BCBL-1 cells were loaded ir. each 
lane. Gels were run at a gradient of 6.0 V/cm with a 
rur. tine of 28 h 28 min . on a CHEF Mapper XA pulsed 
field gel electrophoresis apparatus (Biorad, Hercuj.es 
CA) , Southern blotted and hybridized to KS5273am, 
KS3 3 03am and an E3V terminal repeat sequence [4 0] . 

Induction csf Genome Reel: ration : Late log phase 
3G3L-1 cells (5xlC E cells per mi) were incubated with 
varying amounts of 12 -O- tetradecanoylphorbcl - 13 - 
acetate (TPA, Sigma Chemical Co., St. Louis MO) for 46 
h, cells were then harvested and washed with 
phosphate-buffered saline <P3S) and DNA was isolated 
by chloroform-phenol extraction. DNA concentrations 
were determined by UV absorbance; 5 fig of whole cell 
DNA was quantitatively dot blot hybridized in 
triplicate {Manifold I, Schleicher and Schuell, 
Keene NH) . KS631Bam, EBV terminal repeat and beta- 
20 actin sequences were random-primer labeled with 3 'P 

[13] . Specific hybridization was quant it at ed on a 
Molecular Dynamics Phosphorlmager 425E. 

re II Cultures and Transmission Studies: Cells were 
25" ■ maintained at 5x10 s cells per ml in RPMI 1640 with 
20% fetal calf serum <FCS, Gibco-BRL , Gaithersburg 
MD) and periodically examined fcr continued KSHV 
infection by PCR and dot hybridization. The T cell 
line Molt -3 (a gift from Dr. Jodi Black, Centers for 
Disease Control and Prevention), Ra j i cells 
(American Type Culture Collection, Rockvilie MD) and 
RCC-1 cells were cultured in RPMI 1640 with 10% PCS. 
Owl monkey kidney cells (American Type Culture 
Collection, Rockville MD) were cultured in MEM with 
10% FCS and 1% nonessential amino acids (Gibco-BRL, 
Gaithersburg MD) . 
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To produce the RCC-1 cell line, 2x10 s Raji cells were 
cultivated with 1 . 4x10 s BCBL-1 cells in the presence 
of 20 ng/ml TP A for 2 days in chambers separated by 
Falcon 0.45 filter tissue culture inserts to 

=; prevent contamination of Raji with 3C3L-1. 

Demonstration that RCC-1 was not contaminated with 
3C3L-1 was obtained by PCR typing of KLA-DR alleles 
[27] (Raji and RCC-1 : DR/3l*0310, DR£3*02; 3C3L-1: 
DR?104,*07, Dr/34*01) and confirmed by flow cytometry 
to" determine the presence (Raji, RCC1) or absence 
(BCBL-1) of EMA membrane antigen. Clonal sublines 
of RCC-1 were obtained by dilution in 96 well plates 
to 0.1 cells/well in RPMI 1640, 20% FCS and 30% T- 
STIM culture supplement (Collaborative Biomedical 
Products, Bedford MA) . Subcultures were examined to 
ensure that each was derived from a single cluster 
of growing cells. 

In situ hybridization was performed with a 
previously described 25 bp oligomer located in ORF26 
which was 5' labeled with fluorescein (Operon, 
Alameda CA) and hybridized to cytospin preparations 
of 3C5L-1 , RCC-1 and Raji cells using the methods 
of Lungu et al . [29]. Slides were both directly 
visualized by UV microscopy and by incubating slides 
with anti-fluorescein-alkaline phosphatase (AP) - 
conjugated antibody (Boehringer-Mannheim, 
TndianaDolis IN), allowing immunohistochemical 
detection of bound probe. Positive control 

hvbridization was performed using a 26 bp TET- 
-■abeied E3V DNA polymerase gene oligomer (Applied 
Biosvstems, Alameda CA) which was visualized by UV 
m n otoscopy only and negative control hybridization 
was performed using a 25 bp 5' f luorescein-labelea 
HSVW47 gene oligomer (Operon, Alameda CA) which 
was visualized in a similar manner as the KSKV ORF26 
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probe. All nuclei of BCBL-1, RCC-1 and Raji 
appropriately stained with the EBV hybridization 
probe whereas no specific staining of the cells 
occurred after hybridization with the KSVi probe. 

The remaining suspension . cell lines used in 
transmission experiments were pelleted. and 
resuspended in 5 ml of 0 . 22 or 0 . 45 n filtered 3C3L- 
1 tissue culture supernatant for 16 h. 3C3L-1 
supematants were either from unstimulated cultures 
or from cultures stimulated with 20 ng/ml TPA. No 
difference in transmission to recipient cell lines 
was noted using various filtration or stimulation 
conditions. Fetal cord blood lymphocytes (FCBL) 
were obtained from heparinized fresh post-partum 
umbilical cord blood after separation on Ficoll- 
Paoue (Pharmacia LKB, Uppsala Sweden) gradients and 
cultured in RPMI 1640 with 10% fetal calf serum. 
Adherent recipient cells were washed with sterile 
Hank's Buffered Salt Solution (HBSS, Gibco-BRL, 
Gaithersburg MD) and overlaid with 5 ml of BCBL-1 
media supernatant. After incubation with BCBL-1 
media supernatant, cells were washed three times 
with sterile HBSS, and suspended in fresh media. 
Cells were subsequently rewashed three times every 
other day for six days and grown for at least two 
weeks prior to DNA extraction and testing. PCR to 
detect" KSHV infection was performed using nested and 
unnested primers from ORF 26 and ORF 25 as 
previously described [10, 35] . 

T^—rr Assav : AIDS-KS sera were 

obtained from ongoing cohort studies (provided by 
Drs Scott Holmberg, Thomas Spira.and Harold Jaffe, 
Centers for Disease Control, and Prevention, and 
Isaac Weisfuse, New York City Department of Health) . 
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Sera from AIDS-KS patients were drawn between 1 and 
31 months after initial KS diagnosis, sera from 
intravenous drug user and homosexual/bisexual 
controls were drawn after non-KS AIDS diagnosis, and 
= sera from HIV-infected hemophiliac controls were 

drawn at various times after KIV infection, 
immunofluorescence assays were performed using an 
ecraal volume mixture of goat anti -human igG-FITC 
conjugate (Molecular Probes, Eugene OR) and goat 
-0 anti-human IgM-FITC conjugate (Sigma Chemical Co., 

St. Louis MO) diluted 1:100 and serial dilutions of 
natient sera. End-point titers were read blindly 
■ and specific immunoglobulin binding was assessed by 
the presence or absence of a specular fluorescence 
15 pattern in the nuclei of the plated cells. To 

adsorb cross-reacting antibodies, 20 nl serum 
diluted 1:10 in phosphate-buffer saline (PBS), pH 
7.4, were adsorbed with l-3xl0 7 paraf ormaldehyde- 
rixed P3H3 cells for 4-10 h at 25° C and removed by 
low soeed centrif ugation . P3H3 were induced prior 
to fixation with 20 ng/ml TPA for 48 h . fixed with 
1% paraformaldehyde in PBS for 2 h at 4° C, and 
washed three times in PBS prior to adsorption. 



RESULTS 



c-^.t,™ An^v^* nf a ™ t ^ KSHW nNA Sequence , 
To demonstrate that KS3303am and KSS31Bam are 
genomic fragments from a new and previously 
^characterized herpesvirus, a lambda phage clone 
(KS5) derived from an AIDS-KS genomic DNA library 
was identified by hybridization to the KS3303am 
seou-nce. The KSS insert was subcloned after 
Notl/BamHI digestion into five subfragments and both 
st-ands of each fragment were sequenced by primer 
waling or nested deletion with a 7-fold average 
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redundancy. The KS5 sequence is 20,705 bp in length 
and has a G + C content of 54.0%. The 
observed/expected CpG dinucleotide ratio is 0.92 
indicating no overall CpG suppression in this 
region . 

Open reading frame (ORF) analysis identified 15 
complete ORFs with coding regions ranging from 231 
bp to 4128 bp in length, and two incomplete ORFs at 
the termini of the KS5 clone which were 13 5 and 552 
bu in length (Figure 12) . The coding probability of 
each ORF was analyzed using GRAIL 2 and 
CodonPreference which identified 17 regions having 
excellent to good protein coding probabilities. 
Each region is within an ORF encoding a homolog to 
a known herpesvirus gene with the exception of one 
ORF located at the genome position corresponding to 
ORF28 in herpesvirus saimiri (HVS) . Codon 
preference values for all of the ORFs were higher 
across predicted ORFs than in non- coding regions 
when using a codon table composed of KS5 homologs to 
the conserved herpesvirus major capsid (MCP) , 
glvcocrotein H (gK) , thymidine kinase <TK) , and the 
Putative DNA packaging protein (ORF2 9a/ORF2 9b) 
genes . 

The translated sequence of each ORF was used to 
search GenBank / EMBL databases with BLASTX and FastA 
algorithms [2, 38]. All of the putative KS5 ORFs , 
except one, have sequence and collinear positional 
homology to ORFs from gamma- 2 herpesviruses, 
especially HVS and equine herpesvirus 2 (EHV2) . 
Because of the high degree of collinearity and ammo 
acid seauence similarity between KSHV and HVS, KSHV 
ORFs have been named according to their HVS 
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positional homologs (i.e. KSHV ORF25 is namec -:er 
HVS ORF 25) . 

The KS5 sequence spans a region which includes tnree 
of the seven conserved herpesvirus gene blocks 
(Figure 14) [10]. ORFs present: in these blocks 
include genes which encode herpesvirus virion 
structural proteins and enzymes involved in DKA 
metabolism and replication. Ammo acid identities 
between KS5 ORFs and HVS ORFs range from 30% to 60%, 
with the conserved MCP ORF25 and ORF29b genes having 
the highest percentage amino acid identity to 
homologs in other gammaherpesvi ruses . KSHV ORF2 B , 
which has no detectable sequence homology to HVS or 
EBV genes, has positional homology to HVS ORF28 and 
EBV 3DLF3. ORF2B lies at the junction of two gene 
blocks (Figure 14); these junctions tend to exhibit 
areater sequence divergence than intrablock regions 
among herpesviral genomes [17] . Two ORFs were 
identified with sequence homology to the putative 
spliced protein packaging genes of HVS 
( ORF2Sa/ORF2 9b) and herpes simplex virus type 1 
(UL15) • The KS330Bam sequence is located within 
KSHV ORF26, whose HSV-1 counterpart, VP23 , is a 
minor virion structural component. 

For every KSHV homolog, the HVS ammo acid 
similarity spans the entire gene product, with the 
exception of ORF21, the TK gene. The KSHV TK 
homolog contains a proline-rich domain at its ammo 
terminus <nt 20343-19636 ; aa 1-235) that is not 
conserved in other herpesvirus TK sequences, while 
the carboxyl terminus (nt 19637-18601; aa 237-565) 
is highly similar to the corresponding regions of 
HVS, EHV2 , and bovine herpesvirus 4 (3HV4 } TK. A 
purine binding motif with a glycine-rich region 
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found in herpesviral TK genes, as well as other TK 
genes, is present in the KSHV TK homolog (GVMGVGKS; 
aa 260-267) . 

The KS5 translated amino acid sequences were 
searched against the PROSITE Dictionary of Protein 
Sites and Patterns (Dr. Amos Bairoch, University of 
Geneva, Switzerland) using the computer program 
Motifs. Four sequence motif matches were identified 
among KSHV hypothetical protein sequences. These 
matches included: (i) a cytochrome c family heme-binding 
motif in ORF3 3 (CVHCHG; aa 209-214) and ORF3 4 (CLLCHI; 
aa 257-2S1), !ii) an immunoglobulin and major 
histocompatibility complex protein signature in ORF2S 
(FICQAKH; aa 1024-1030), (iii) a mitochondrial energy 
transfer protein motif in ORF26 ( PDDITRMRV ; aa 260- 
268), and (iv) the purine nucleotide binding site 
identified in 0RF21. The purine binding motif is the 
only motif with obvious functional significance. A 
cvtosine-specific methylase motif present in HVS ORF27 
is not present in KSHV 0RF27. This motif may play a 
role in the methylation of episomal DNA in cells 
persistently infected with HVS [1] . 

^v^np^ti^ analysis nf KSHV: Amino acid sequences 
translated from the KS5 sequence were aligned with 
co—esoonding sequences from other herpesviruses. On 
the basis cf the level of conserved aligned residues and 
^e low incidence of introduced gaps, the amino acid 
alignments for ORFs 21, 22. 23, 24, 25, 26, 29a. 29b, 31 
and 34 were suitable for phylogenetic analyses. 

t 0 demonstrate the phylogenetic relationship of KSHV to 
ether herpesviruses, a single-gene comparison was maae 
*o~ OR^25 (MCP) homologs from KSS and twelve members of 
Heroesviridae (Figures 15A-15B) . The thirteen available 
MC p" amino acid sequences are large (1376 a. a. residues 
for the KSHV homolog) and alignment requirea only a low 
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level of gapping; however, the overall similarity 
between viruses is relatively low [33] . The MC? set 
gave stable trees with high bootstrap scores and 
assigned the KSHV homclog to the gamma -2 sublineage 
5 (genus Khadinovirus >. containing HVS , EKV2 and 3VK4 

[20, 23, 43]- KSHV was most closely associated with 
^ JS similar results were obtained for single-gene 
alignments of TK and UL15/ORF2 9 sets but with lower 
bootstrap scores so that among gamma- 2 herpesvirus 
-0 members branching orders for EKV2 , KVS ana KSHV were not 
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resolved . 



To determine the relative divergence between KSHV and 
ether gammaherpesviruses, alignments for the nine genes 
listed" above were concatenated to produce a combined 
aammaherpesvirus gene set (CS1) containing EBV, EHV2 , 
KVS and KSHV amino acid sequences. The total length of 
CS1 was 4247 residues after removal of positions 
containing gaps introduced by the alignment process in 
one or more of the sequences. The CS1 alignment was 
analyzed by the ML method, giving the tree shown in 
Fiaure 153 and by the KP and NJ methods used with the 
aliened herpesvirus MCP sequences. All three methods 
identified KSHV and HVS as sister groups, confirming 
that KSHV belongs in the gamma-2 sublineage with HVS as 
its closest known relative. It was previously estimated 
that divergence of the HVS and EHV2 lineages may have 
been contemporary with divergence of the primate and 
unoulate host lineages [33] . The results for the CS1 
set suggest that HVS and KSHV represent a lineage of 
primate herpesviruses and, based on the distance between 
KSHV and KVS relative to the position of SKV2 , 
divergence between HVS and KSHV lines is ancient. 

Gnomic Indies of KSHV : 

™, F electrophoresis performed on BC3L-1 ceils embeaaea 
<n aaarose olugs demonstrated the presence of a 
integrated KSHV genome as well as a high molecular 
weicht soecies (Figures 16A-16B). KS63l3am (Figure ISA) 
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and KS330Bam specifically hybridized to a single CHEF 
gel band emigrating with 270 kilobase <kb) linear DNA 
standards. The majority of hybridizing DNA was pre.en- 
a diffuse band at the well origin; a low i-tensit> 



10 



nigh molecular weight (HMW) band was also present 
immediately below the origin (Figure 16A. arrow). ,he 
sam° £ ilter was stripped and probed with an E3V termma- 
-neat seauence [40] yielding a 150-160 kb band (Figure 
.S3) corresponding to linear EBV DNA [24] . The HMK r.SV 
band may correspond to either circular or concatemenc 
E3V DNA [24] . 

-he phorbol ester TPA induces replication-competent EBV 

to enter a lytic replication cycle [49] . To determine 

if TPA induces replication of KSHV and EBV in BCBL-1 

ceils these cells were incubated with varying 

concentrations of TPA for 4B h (Figure 17). Maximum 

stimulation of EBV occurred at 20 ng/ml TPA which 

resulted in an eight -fold increase in hybridizing EBV 

nnlv a 13-14 fold increase in KSHV genome 
20 aenome . Only a i • J ■>■■* 

Abundance occurred after 20-80 ng/ml TPA incubation for 

46 h. 



^■^•ansmi ss-ion Studies: 

lit— to determining that the agent was likely to be a 
member of Herpesviridae by sequence analysis, BCBL-1 
c --m s were cultured with Raji cells, a r.or.lytic EBV 
^ans-'ormed B cell line, in chambers separated by a 0.45 
^tissue culture filter. Recipient Ra 3 i cells generally 
demonstrated rapid cytolysis suggesting transmission o. 
a cytotoxic component from the BCBL-1 cell Ime. One 
Ra ,, n4ne cultured in 10 ng/ml TPA for 2 days, unaerwent 
tial period of cytolysis before recovery and 
resumption of logarithmic growth. This cell Ime (RC - 
n , is" a monoculture derived from Raji uncontaminateo by 
BTT.-n as determined by PCR amplification oz r.^-DR 
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RCC-1 has remained positive for the KS330 :33 PCR proouct 

for >6 months in continuous culture (approximately 7 0 

massages), but KSHV was not detectable by aot or 

"southern hybridization at any time. Jr. s::u 

5 hvbriditation, however, with a 25 bp KSHV ORF26 - derives 

oligomer was used to demonstrate persistent localization 

cf KSHV DNA to RCC-1 nuclei. As indicated in .-.cures 

-i • ~* nra: -i an- 5 PT-" (from nassaae -65* 
1BA-18C, nuclei of BC3u i an- k*.^ - . 

cells had detectable hybridization with the ORF2S 
oUaomer, whereas no specific hybridization occurrec 
with narental Raji cells (Figure 18B) . KSHV sequences 
were detectable in 65% of BCBL-1 and 2.6% of RCC-1 ceils 
und »- these conditions. In addition, forty-five 
monoclonal cultures were subcultured by serial dilution 
-om RCC-1 at passage 50, of which eight (18%) clones 
„=re PCR positive by KS330 J33 . While PCR detection using 
unnested KS330 J33 primer pairs was lost by passage 15 m 
each of the clonal cultures, persistent KSHV genome was 
detected in 5 clones using two mere sensitive 
nonoverlapping nested PCR primer sets [3 3] suggesting 
that KSHV genome is lost over time in RCC-1 ana its 
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clones . 



tow b- oersistent levels of KS33C 223 PCR positivity were 
-ound for one of four Raji, one of four Bjab, two of 
-w_ e M olt-3, one of one owl monkey kidney cell lines 
and three of eight human fetal core blood lymphocyte 
,rr SL) cultures after inoculation with 0.2-0.4= y. 
filtered BCBL-1 supernatants . Among the PCR positive 
„ ul ~ u ,-~ s PCR detectable genome was lost after 2-6 weeks 
Ind" multiple washings. Five FC3L cultures developed 
cell clusters characteristic of EBV immorta.izea 



^vmohocytes and were positive for *3V by PC?, using F.BER 
Primers [23); three of these cultures were also 
35 "initially KS330 :3i positive. None of the recipient cel. 

lines had detectable KSHV genome by aot blot 
hybr i di z at i on . 
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Se^lQg^ st udies: 

xndi-ecc immunofluorescence antibody assays (IFA) were 
used co assess the presence of specific antibodies 
aaair.st the KSHV- and EBV- infected cell line BS.-C in 
-*e sera from AIDS-KS patients and control patients w.t.. 
vtv 'Section or AIDS. BHL-6 was substituted for BCBL-i 
Z- masons of convenience; preliminary studies showec 
-~ -ior^icaar differences in IFA results between EKL-c 
a _. B "^ L .-, BHL-6 have diffuse immunof luorescer.t ceil 
s-I^Ino with most KS patient and control unabsorbed 
s I ra " suaaesting nonspecific antibody binding (Figures 
, 9A- 9D)"~ After adsorption with paraf ormaldehyde- f ixea , 
Educed P3K3 (an EBV producer subline of P3J-HR1. a 
o* Dr. George Miller) to remove cross-reacting 
an- bodies against EBV and lymphocyte antigens, patient 
sera generally showed specular nuclear staining at high 
.iters while this staining pattern was absent from 
control patient sera (Figures 19B and 19D) . Staimng 
was localized primarily to the nucleus but weak 
cytoplasmic staining was also present at low sera 
dilutions . 

W-b unadsorbed sera, the initial endpoint geometric 
mea „ . iters (GKT) against BHL-6 cell antigens for the 
s^a from AIDS-KS patients (GMT-l : 1153 . range: 1:1 = 0 to 
, .-2 ^50) were higher than for sera from control, non-KS 
; a -Mnts (GMT-l = 342; range 1:50 to 1:12.150; p=0.04) 
' ( r <BU - e 13). While AIDS-KS patients and KlV-mtected 
-av'/bisexual and intravenous drug user control patients 
l a l s < ff n ir endooint titers to BKL-6 antigens 
" (G MT = 1-.1263 and GMT=1:1578. respectively), hemophilic 

/r*vr_- .1041 Boer, case 
A--S natient titers were lower (GKi 04). 

and control patient groups had eievatea IrA titers 
against the EBV infected cell line P3K3 . 

d^ Terence in endpoint GMT between case and control 
titers against BHL-6 antigens increased a.-e. a 
w ,- r = 3 ---3 After adsorption, case GMT aeclmed to 1.780 
.;-"-orr-ol GMT declined to 1:81 (0=0.00009). Similar 
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results were obtained by using BCBL-1 instead of 3KL-6 
cells, by pre-adsorbing with EBV- infected nonproducer 
Raji cells instead of P3K3 and by using sera from a 
homosexual male KS patient without HIV infection, ir. 
complete remission for KS for 9 months (EKL-6 titer 
1:450, ?3H3 titer 1:150). Paired sera taken E-I4 months 
^rior to KS onset and after KS onset were available for 
three KS patients: KS patients B and 13 had eight- foio 
rises and patient 8 had a three-fold fall in ?3H3- 
adsorbed BCBL-1 titers from pre-onset sera to post-KS 
sera . 

PTSCUSSION 

These studies demonstrate that specific DNA sequences 
found in KS lesions by representational difference 
analysis belong to a newly identified human herpesvirus. 
The current studies define this agent as a human gamma- 2 
heroesvirus that can be continuously cultured in 
naturally-transformed, EBV-coinf ected lymphocytes from 
20 AIDS-related body-cavity based lymphomas. 



2 r 



Sequence analysis of the KS5 lambda phage insert 
provides clear evidence that the KS330Bam sequence is 
part of a larger herpesvirus genome. KS5 has a 54.0% 
C-C content which is considerably higher than the 
corresponding HVS region (34.3% G + C) . While there is no 
-oG dinucieotide suppression in the KS5 sequence, the 
corresponding HVS region has a 0.3 3 expected : observed 
COG dinucleotide ratio [1] . The C P G dinucieotide 
f'reouency in herpesviruses varies .from global CpG 
suppression among gammaherpesviruses to local CpG 
suppression in the betaherpesviruses , which may result 
f- om deamination of 5 • -tnethylcytosine residues at Cpo 
s^tes resulting in TpG substitutions [21]. CpG 
suppression among herpesviruses [21. 30. 44] has been 
hypothesized to reflect co-replication of latent genome 

, . .j. i _^ i t c- hu fc ' z is unknown 

in actively dividing host cells, ~ 

whether or not KSHV is primarily maintained by a lytic 
replication cycle in vivo. 



WO 96/06159 



PCTAJS95/10194 



Id 



136 

The 20,705 bp KS5 fragment has 17 protein- coding 
regions, 15 of which are complete ORFs with 
appropriately located TATA and polyadenylat ion signals, 
and two incomplete ORFs located at the phage insert 
termini. Sixteen of these ORFs correspond by sequence 
and collinear positional homology to 15 previously 
identified herpesviral genes including the highly 
conserved spliced gene. The conserved positional and 
sequence homology for KSHV genes in this regicn are 
consistent with the possibility that the biological 
behavior of the virus is similar to that of other 
gammaherpesviruses. For. example, identification of a 
thvmidine kinase- like gene on KS5 implies that the agent 
is" potentially susceptible to TK-activated DNA 
polymerase inhibitors and like other herpesviruses 
Possesses viral genes involved in nucleotide metabolism 
and DNA replication [41] . The presence of major capsid 
protein and glycoprotein H gene homologs suggest that 
replication competent virus would produce a capsid 
20 structure similar to other herpesviruses. 

Phylogenetic analyses of molecular sequences show that 
KSKV belongs to the gamma-2 sublineage of the 
Gammaherpesvirinae subfamily, and is thus the first. 
25 human gamma-2 herpesvirus identified. Its closest known 

relative based on available sequence comparisons is HVS , 
a squirrel monkey gamma-2 herpesvirus that causes 
fulminant polyclonal T ceil lymphoproiif erat ive 
disorders in some New World monkey species. Data for 
3C the gamma-2 sublineage are sparse: only three viruses 

(KSHV, HVS and EHV2 ) can at present be placed on the 
Phvloaenetic tree with precision (the sublineage also 
contains murine herpesvirus 68 and BHV4 [33] ) - Given 
the limitation in resolution imposed by this thin 
-.5 background, KSHV and HVS appear to represent a lineage 

o' u-imate gamma-2 viruses. Previously, McGeoch et al . 
[3 3] proposed that lines of gamma-2 herpesviruses may 
have oriainated by cospeciation with the ancestors of 
th~r host species. Extrapolation of this view to KSHV 
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and HVS suggests that these viruses diverged at an 
ancient time, possibly contemporaneously with the 
divergence of the Old World and New World primate host 
lineages. Gammaherpesviruses are distinguished as a 

3 subfamily by their lymphotrophism [41] and this grouping 

is supported by phylogenetic analysis based on sequence 
da~a [33] • The biologic behavior of KSHV is consistent 
with its phylogenetic designation in that KSHV can be 
found in in vitro lymphocyte cultures and in in vivo 

.0 samples of lymphocytes [3] . 

This band appears to be a linear form of the genome 
because other "high molecular weight" bands are present 
• for both EBV and KSHV in 3C3L-1 which may represent 
circular forms of their genomes. The linear form of the 
E3V genome, associated with replicating and packaged DNA 
[41]" migrates substantially faster than the closed 
circular form associated with latent viral replication 
[24] . While the 270 kb band appears to be a linear 
form, it is also consistent with a replicating dimer 
piasmid since the genome size of HVS is approximately 
13 5 kb. The true size of the genome may only be 
resolved by ongoing mapping and sequencing studies. 

2^ Replication deficient EBV mutants are common among EBV 

s-rair.s passaged through prolonged tissue culture [23] . 
The EBV strain infecting Raji, for example, is an BALF-2 
deficient mutant [19] ; virus replication is not 
inducibile with TPA and its genome is maintained only as 
, 0 a' latent circular form [23, 33]. The EBV strain 

.o^fecunc 3CBL-1 does not appear to be replication 
deficient because TPA induces eight-fold ^easesj.n 
DNA content and has an apparent linear form on Cr^F 
electrophoresis. KSHV replication, however, is omy 
marginally induced by comparable TPA treatment 
beating either insensitivity to TPA induction or tnat 
aenome has undergone loss of genetic elements 
^eau-ed for TPA induction. Additional experiments, 
however, indicate that KSHV DNA can be pelleted ov hign 
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speed centrifugation of filtered organelle- free , DNase 
'-protected BC3L-1 cell extracts, which is consistent 
with KSHV encapsidation. 

Transmission of KSHV DNA from BCSL-1 to a variety of 
recipient cell lines is possible and KSHV DNA can be 
maintained at low levels in recipient cells for up to 70 
passages. However, detection of virus genome in 
"recipient cell lines by PCR may be due to physical 
association of KSHV DNA fragments rather than true 
infeccion . T his appears to be unlikely given evidence 
for specific nuclear localization of the ORF26 sequence 
in RCC-1. If transmission of infectious virus from 
3C3L-1 occurs, it is apparent that the viral genome 
declines in abundance with subsequent passages of 
recipient cells. This is consistent with studies of 
spindle cell lines derived from KS lesions. Spindle 
cell cultures generally have PCR detectable KSHV genome 
when first explanted, but rapidly lose viral genome 
after initial passages and established spindle cell 
cultures generally do not have detectable KSHV sequences 

[31 • 

Infections with the human herpesviruses are generally 
ubiquitous in that nearly all humans are infected by 
early adulthood with six of the seven previously 
identified human herpesviruses [42] . Universal 
infection with EBV, for example, is the primary reason 
for the difficulty in clearly establishing a causal role 
for this virus in EBV-associated human tumors. The 
serologic studies identified nuclear antigen in BC3L-1 
and BHL-6 which is recognized by sera from AIDS-KS 
natients but generally not by sera from control AIDS 
patients without KS after removal of ESV-reactive 
antibodies. These data are consistent with PCR studies 
of KS and control patient lymphocytes suggesting that 
KSHV is not ubiquitous among adult humans, but is 
specifically associated with persons who develop 
Kaposi's sarcoma. In this respect, it appears to be 
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eoidemiologicaily similar to HSV2 ratner than the ether 
known human herpesviruses. An alternative possir^ity 
is that elevated I FA titers against 3CE1-1 reflect 
disease status rather than infection with :he virus. 
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EXPERIMEl r r&T ' DETAILS SECTION III f, 

KS ? 5 ner- Pn^Ilmei:: : Cases and controls were selected 
from ongoing cohort studies based on the avai.aci-ity zz 
clinical inf crmat ion and appropriate PBMC sairoj.es. 
homosexual or bisexual men with AIDS who developed KS 
durincr their participation in prospective cohort studies 
were Identified [14-16] . Fourteen of these patients had 
paired PBMC samples collected after KS diagnosis '.median 
~-4 months) and at least four months prior to ^ KS 
diagnosis (median -13 months), while the remaining .• had 
paired PBMC taken at the study visit immediately prior 
to KS diagnosis {median -3 months) and at entry into 
their cohort study (median -51 months prior to KS 
diagnosis) . 

apmorahi 1 ^ and Homosexu al /Bisexna 1 Male Patient 
r^.r.rci Errollment : Two control groups of AIDS patients 
were examined: 23 homosexual/bisexual men with AIDS 
followed until death who did not develop KS ("high risk" 
control group) from the Multicenter AIDS Cohort Study 
[16]), and 19 hemophilic men ("low risk" control group) 
enrolled from joint projects of the National Hemophilia 
Foundation and the Centers for Disease Control and 
Prevention. Of the 16 hemophilic controls with 

available follow-up information, none are known to have 
developed KS and <2% of hemophilic AIDS patients 
historically develop KS [2]. For homosexual /bisexual 
AIDS control patients who did not develop KS , paired 
PBMC specimens were available at entry. into their cohort 
studv "(median -35 months prior to AIDS onset) and at the 
study visit immediately prior to nonKS AIDS diagnosis 
(median BKL-6 months prior to AIDS onset) . 

nvi rvrrac:.^" ™d Analyses: DNA from 1C £ -10" PBMC in 
each specimen was extracted and quant itated by 
spectrophotometry. Samples were prepared in physically 
/sorted laboratories from the laboratory where 
polymerase chain reaction (PCR) analyses were performed. 
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All samples were tested for amplif lability using primers 
specific for either the KLA-DQ locus (GK2S/GK27) or r- 
aiobin [IB] . PCR detection of KSHV DNA was perfcrrr.ee as 
previously described [7] with the following nested 
primer sets: No. 1 outer 5 ' -AGCACTCGCAGGGCAGTACG- 3 ' , 
5 ' - G ACTCTT CG CTG ATGAACTGG - 3 ' ; No . 1 inner 5 ' - 
TCCGTGTTGTCTACGTCCAG- 3 ' , 5 ' - AG CCG AAAGG ATT CC AC CAT- 3 ' ; Nc . 

2 outer 5 ' - AGGC AACGT CAG ATGTGAC - 3 ' , 5'- 
GAAA7TACCCACGAGATCGC-3' ; No. 2 inner 5'- 
CATGGGAGTACATTGTCAGGACCTC- 3 ' , 5' -GGAATTATCTCGCAGGTTGCC - 

3 ' ; No . 3 outer 5 ' -GGCGACATTCATCAACCTCAGGG- 3 ' , 5 ' - 
ATATCATCCTGTGCGTTCACGAC- 3 ' ; No. 3 inner 5'- 
CATGGGAGTACATTGTCAGGACCTC- 3 ' , 5 ' - GGAATTATCTCGCAGGTTGCC - 
3' . The outer primer set was amplified for 35 cycles at 
94° C fcr 30 seconds, 60° C for 1 minute and 72° C for 

1 minute with a 5 minute final extension cycle at 72° C. 
One to three ml of the PCR product was added to the 
inner PCR reaction mixture and amplified for 25 
additional cycles with a 5 minute final extension cycle. 
Primary determination of sample positivity was made with 
primer set No. 1 and confirmed with either primer sets 

2 or 3 which amplify nonover lapping regions of the KSHV 
hypothetical major capsid gene. Sampling two portions 
cf the KSHV genome decreased the likelihood of 
in-raexperimental PCR contamination. These nested 
rrimer sets are 2-3 logs more sensitive for detecting 
KSHV sequences than the previously published KS330 SS3 
primers [6} and are estimated to be able to detect <10 
copies of KSHV genome under optimal conditions. Sample 
preparations were prealiquoted and. amplified with 
alternating negative control samples without DNA to 
monitor and control possible contamination. Ail samples 
were tested in a blinded fashion and a determination of 
the positivity/negativity made before code breaking. 
Significance testing was performed with Mantel -Haenszel 
chi- squared estimates and exact confidence intervals 
usina Epi-lnfo ver . 6 (USD Inc., Stone Mt . GA) . 
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yq ry ppsi^^rv of r»** zmd C nr^ri ?3I«r Sar.cles : 
T>a J ~d P3MC samples were available from each KS patient 
and'homosexual/bisexual control patient; a single sample 



was ava 



liable from each hemophilic control patient. 



To determine the KSHV positivity rate for each group o: 
AIDS oatients. a single specimen from each participant 
taken' closest to KS or other AIDS-def ining illness 

("second sample") was analyzed. Overall, 12 of 21 (57%) 
of P3MC specimens from KS patients taken from 6 months 
prior to KS diagnosis to 20 months after KS diagnosis 
'were KSHV positive. There was no apparent difference in 
positivity rate between immediate pre- diagnosis and 
post-diagnosis visit specimens (4 of 7 (57%) vs. 8 of 14 

(57%) respectively). 

The number of KSHV positive control PBMC specimens from 
both homosexual/bisexual (second visit) and hemophilic 
patient controls was significantly lower. Only 2 of 19 
"(11%) hemophilic PBMC samples were positive (odds ratio 
11.3, 95 %'confidence interval 1.6 to 118) and only 2 of 
23 (9%) PBMC samples from homosexual/bisexual men who 
d'd not develop KS were positive (odds ratio 14.0, 95% 
confidence interval 2.3 to 144) . If all KS patient PBMC 
samples taken immediately prior to or after diagnosis 
„~-e truly infected, the PCR assay was at least 57% 
sensitive in detecting KSHV infection among PBMC 
samples. No significant differences in CD4 + counts were 
--ound for KS patients and homosexual /bisexual patients 
without KS at the second sample evaluation (Kruskall- 
wailis p=C15) (Figure 21) . CD4~ counts from the single 
sample 'from hemophilic AIDS patients were higher than 
CD4*. counts from KS patients (Kruskall-Kaiiis p=C.0O4). 
although both groups showed evidence of KIV- related 
immunosuppression . 
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Loncitudinal q-udies: 

Paired specimens were available from all 21 KS patients 
and 23 homosexual /bisexual male AIDS control patients 
who did not develop KS . For the KS group, initial ?3MC 

5 samples were taken four to 87 months (median 13 months) 

prior to the onset of KS . Initial P3KC samples fror. the 
control group were drawn 13 to 106 months (median 5 5 
months) prior to onset of first nonKS AIDS-def ir.ing 
illness (1987 CDC surveillance definition) . 11 or 21 
0 (52%) of KS patients had detectable KSKV DNA in P3MC 

samples taken prior to KS onset compared to 2 of 19 
(11%, p=0.005) hemophilic control samples, and 1 (4%, 
p=0.0004) and 2 (9%, p=0.002) of 23 homosexual/bisexual 
- control samples taken at the first and second visits 

.5 respectively (Figures 20A-20B) . The figure shows that 

7 of the paired KS patient samples were positive at both 
visits, 5 KS patients and 2 control patients converted 
from negative to positive and two KS patients and one 
control patient reverted from positive to negative 

20 between visits. The remaining 7 KS patients and 20 

control patients were negative at both visits. 

For the 5 KS patients that converted from an initial 
negative P3MC result to a positive result at or near to 

2 5 KS diagnosis, the median length of time between the 

first sample and the KS diagnosis was 19 months. Three 
cf the 6 KS patients that were negative at both visits 
had their last PBMC sample drawn 2-3 months prior to 
onset of niness. It is unknown whether these patients 

30 became infected between their last study visit and the 

KS diagnosis date. 

r-TSCUSSION 

Ambroziak and coworkers have found evidence that KSHV 
preferentially infects CD19+ B cells by PBMC subset 
examination of three patients [19] - Other 
gammaherpesviruses, such as Epstein-Barr virus (EBV) and 
herpesvirus saimiri are also lymphotrophic herpesviruses 
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and can cause lymphoprolif erative disorders in primates 
[II, 20] . 

It is possible chat KSHV, like mos: human herpesviruses, 
is a ubiquitous infection of adults [21]. E3V , for 
example, is aetectable by P-R — w_i_s . 
from" virtually all seropositive persons 122^ an= 
accroxima-eiy 98% MACS study participants had E3V VCA 
antibodies at entry into the cohort study [23] . Tne 
findings, however, are most consistent with control 
oa-jeats having lower KSHV infection rates than cases 
*a«d that KSHV is specifically associated with the 
subseauent development of KS . While it is possible that 
control uatients are infected but have an undetectabiy 
low KSHV viral PBMC load, the inability to find evidence 
o* infection in control patients under a variety of PCR 
conditions suggests that the majority of control 
oa-ients are not infected. Nonetheless, approximately 
*10% of these patients were KSHV infected and did not 
develoo KS. It is unknown whether or not this is 
similar to the KSHV infection rate for the general human 
copulation . 

This study demonstrates that KSHV infection is both 
c~ onaly associated with KS and precedes onset of 
~ s -ase in the majority of patients. 57% of KS patients 
had detectable KSHV infection at their second follow-up 
v<si- (52% orior to the onset of KS] compared to only 9% 
o* homosexual /bisexual (p=0.002) and 11% of hemophilic 
control patients (p=0.005) . Despite similar CD4* levels 
be-ween homosexual /bisexual KS cases and controls. KSHV 
DNA oositivity rates were significantly higher for cases 
a- both the first (p=0.005) and second sample visits 
indicating that immunosuppression alone was not 
^esoonsible for these elevated detection rates. It is 
ais"o unlikely that KSHV simply colonizes existing KS 
lesions in AIDS patients since neither patient group hac 
KS a- the time the initial sample was obtained. Five KS 
parents and two homosexual/bisexual control patients 
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converted from a negative to a positive, possibly due to 



ne 



w infection acquired during the study period. 



The findings are in contrast to PCR detection cf KSKV 
5 DNA in all 10 PBMC samples from KS patients by Ambroziak 

e- al. [19]. xt is possible that the assay was not 
sensitive enough to detect virus in all samples since it 
was required that each positive sample to be repeatedly 
positive by two independent primers in blinded PCR 
10 assays. This appears unlikely, however, given the 

sensitivity of the PCR nested primer sets. The 7 KS 
patients who were persistently negative on both paired 
samoles may represent an aviremic or low viral load 
subpopulation of KS patients. The PCR conditions test 
15 a DNA amount equivalent to approximately 2xl0 3 

lymphocytes; an average viral load less than 1 copy per 
2xl0 3 cells may be negative in the assay. Two KS 
patients and a homosexual /bisexual control patient 
initially positive for KSHV PCR amplification reverted 
2 0 to negative in samples drawn after diagnosis. These 

results probably reflect inability to detect KSHV DNA in 
peripheral blood rather than true loss of infection 
although more detailed studies of the natural history of 
infection are needed. 

2 5 

The study was designed to answer the fundamental 
ouescion of whether or not infection with KSHV precedes 
development of the KS phenotype. The findings indicate 
that there is a strong antecedent association between 
KSKV infection and KS . This temporal relationship is an 
absolute requirement for establishing that KSHV is 
central to the causal pathway for developing KS . This 
study contributes additional evidence for a possible 
causal role for this virus in the development of KS . 
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EXPERIMENTAL DETAIL S SECTION XV: 

To determine if the KHV-KS virus is also present ir. bcth 
endemic and ElV-associated KS lesions fro- African 
5 patients, formalin- fixed, paraffin-embedded tissues from 

both HIV seropositive and HIV seropositive Ugandan' KS 
patients were compared to cancer tissues from patients 
without KS in a blinded case- control study. 

c parient F^rallment: Archival KS biopsy specimens were 

selected from approximately equal numbers of KIV- 
associated and endemic HIV-negative KS patients enrolled 
in an ongoing case-control study of cancer and HIV 
infection at Makerere University, Kampala Uganda. 

5 Control tissues were consecutive archival biopsies from 

patients with various malignancies enrolled in the same 
study, chosen without prior knowledge of HIV serostatus . 
All patients were tested for HIV antibody (measured by 
Cambridge Bioscience Recombigen Elisa assay) . 
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ssue reparation: Each sample examined was from an 
individual patient. Approximately ten tissue sections 
were cut (10 micron) from each paraffin block using a 
cleaned knife blade for each specimen. Tissue sections 
were deparaf f inized by extracting the sections twice 
with 1 ml xylene for 15 min. followed by two extractions 
with 100% ethanol for 15 min. The remaining pellet was 
then resuspended and incubated overnight at 50° C in 0.5 
ml of lysis buffer (25 mM KCl , 10 mM Tris-HCl , pK 8.3, 
1.4 mM MgCl2, 0.01% gelatin, 1 mg/ml proteinase K) . DNA 
was extracted with phenol /chloroform, ethanol 
precipitated and resuspended in 10 mM Tris-HCl, 0 . 1 mM 
EDTA, pH 8-3 . 

p-r amplification: 0.2-0.4 ug of DNA was used in PCR 
reactions with KS330 Z33 primers as previously described 

[7] . The samples which were negative were retested by 
nested PCR amplification, which is approximately 10 : -10 3 

fold more sensitive in detecting KS330„ 3 sequence than 



WO 96/06159 



PCT7US95/10194 



. 3 



20 



*2 n 



the previously published KS330-„ primer set ID ■ These 
samples were tested twice and samples showing disccraant 
results were retested a third time. 51 of 74 samples 
initially examined were available for mae?enae..t 
extraction and testing at Chester Beatty Laboratories, 
London using identical nested PCR primers and conditions 
rc ensure fidelity of the PCR results. Results :rc- 
eight samples were discordant between laboratories ar.d 
were removed from the analysis as umnterpretaoie dour 
positive samples from each laboratory! . Statistical 
comparisons were made using EPI-INFO ver. 5 (USD. Stone 
Mt. GA, USA) with exact confidence intervals. 

RESULTS: 

Of 66 tissues examined, 24 were from AIDS-KS cases, 20 
were from endemic HIV seronegative KS cases, and 22 were 
from cancer control patients without KS . Seven of the 
cancer control patients were HIV seropositive and 15 
were KIV seronegative (Figure 22). Tumors examined in 
the control group included carcinomas of the breast, 
ovaries, rectum, stomach, and colon, fibrosarcoma, 
Ivmohocytic lymphomas, Hodgkin's lymphomas, 

choriocarcinoma and anaplastic carcinoma of unknown 
orimary site. The median age of AIDS-KS patients was 29 
years (range 3-50) compared to 36 years (range 3-79) for 
endemic KS patients and 38 years (range 21-73) for 
cancer controls . 



A^ong KS lesions, 3 9 of 44 (89%) were positive for 
KS330 SJ1 PCR product, including KS tissues from 22 of ^24 
(92%) KIV seropositive and 17 of 20 (65%) r.iV 
seronegative oatients . In comparison, 3 of 22 (14%) 
nonKS cancer control tissues were positive, in = lud ^9 
i c* 7 (14%) KIV seropositive and 2 of 15 (13%) hiV 
seronegative control patients (Figure 19). These 
control oatients included a 73 year old KIV seronegative 
.r.ale and a 29 year old KIV seronegative female with 
breast carcinomas, and a 36 year old KIV seropositive 
female with ovarian carcinoma. The odds ratios for 
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detecting the sequences in tissues from HIV seropositive 
and HIV seronegative cases and controls was 6c (55% 
confidence interval {95% C.I.) 3.6-3161) and 36. £ !95% 
C.I. 4.3-42B) respectively. The overall weighted 
5 Mantel-Haenzel odds ratio stratified by HIV serostatus 

was 49.2 (95% C.I. 9.1-335). KS tissues from four HIV 
seropositive children (ages 3, 5, 6, and l years) and 
four KIV seronegative children (ages 3, 4, 4, ar.o II 
vears) were all positive for KS33C 2 , ; . 

Q 

All discordant results (i.e. KSHV negative KS or KSHV 
positive nonKS cancers) were reviewed microscopically. 
All K£33C = 3 ; PCR negative KS samples were confirmed to be 
• KS. Likewise, ail KS330 : , 3 PCR positive nonKS cancers 
5 were found not to have occult KS histopathclogicaily . 

DTSCUSSION 

These results indicate that KSHV DNA sequences are found 
not only in AIDS-KS [5], classical KS [6] and transplant 
20 KS [7] but also in African KS from both HIV seropositive 

and seronegative patients. Despite differences in 
clinical and epidemiological features, KSHV DNA 
sequences are present in all major clinical subtypes of 
KS from widely dispersed geographic settings. 

2 5 

This study was performed on banked, formalin- fixed 
tissues which prevented the use of specific detection 
assays such as Southern hybridization. DNA extracted 
after such treatment is often fragmented which reduces 

30 the detection sensitivity of PCR and may account for the 

5 PCR negative KS samples found in the study. The 
results, however, are unlikely to be due to PCR 
contamination or nonspecific amplification. Specimens 
were tested blindly and a subset of samples were 

35 independently extracted and tested at a physically 

separate laboratory. Specimen blinding is essential to 
ensure the integrity of results based solely on PCR 
analyses. A subset of amplicons was sequenced and found 
to be more than 98% identical to the published KS33G :3: 



WO 96/06159 



PCT/VS95M0194 



1 5 



sequence confirming their specific na:ure anc. because 
of minor sequence variation, making ins ?oss:r:.::y cr 
contamination unlikely. 

In contrast to previous studies in North American and 
European populations, it was found 3 of 22 control 
tissues to have evidence of KSHV infection. Since these 
cancers represent a variety of tissue types, it is 
unlikely that KSHV has an eticiogic role in these 
tumors. One possible explanation for the findings is 
that these results reflect the rate of KSHV infection m 
the nonKS population in Uganda. Four independent 
controlled studies from North America [5 and9 j Europe 
[71 and Asia [8] have failed to detect evidence of KSHV 
infection in over 200 cancer control tissues, with the 
exception of an unusual AIDS-associated, body-cavity- 
based lymphoma [9] . Taken together, these studies 
indicate that DNA-based detection of KSHV infection is 
rare in most nonKS cancer tissues from developed 
countries. KSHV infection has been reported in post- 
transplant skin tumors, although well - controlled studies 
are needed to confirm that these findings are not due to 
FCR contamination [10] . Since the rate of HIV-negative 
KS is much more frequent in Uganda than the United 
2E States, detection of KSHV in control tissues from cancer 

oatients in the study may reflect a relatively high 
prevalence infection in the general Ugandan population. 

While KS is extremely rare among children in developed 
3C countries [2] , the rate of KS in Ugandan children has 

risen dramatically over the past 3 decades: age- 
standardized rates (per 100,000) for boys age 0-14 years 
were 0.25 in 1964-6B and 10.1 in 1992-93. Detection of 
KSHV crenome in KS lesions from prepubertal children 
suaaes'ts that the virus has a nonsexual mode of 
transmission among Ugandan children. That five of these 
children were 5 years old or less raises the possibility 
that the agent can be transmitted perinataiiy. Whether 
cr not immune tolerance due to perinatal transmission 
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accounts for the more fulminant form of KS occurring ir. 
African children remains to be investigated. 
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METHODS 

Parents Serum was collected from a convenience sample 
of 89 HIV-infected patients seen at several clinical 
sites in Connecticut, New York, and California. 
Demographic and clinical information was recorded on 
standardized forms which were linked to samples by a 
numerical code. Patients were classified as having KS 
if the diagnosis was histologically confirmed or, in the 
opinion of the primary clinician, the diagnosis of KS 
was unequivocal on clinical grounds. Eighty six (97%) 
were male; 90 of the 86 men (93%) were homosexual or 
bisexual. Forty seven patients, ail male, had KS . The 

characteristics of the study population are found in 

Figure 23] . 

rp ii lines The BCBL-1 line was established from an 
ATDS-associated body cavity B cell non-Hodgkm ' s 
ivmohoma [30] . Neither BCBL-1 ceils, nor the tumor from 
which they were derived, express surface immunoglonuim 
or 3 cell specific surface markers; however BCBL-1 cells 
contain immunoglobulin gene rearrangements that are 
characteristic of B cells [31] . KSHV DNA sequences can 
be detected in BCBL-1 cells by DNA representational 
difference analysis [23,32]. BCBL-1 cells also contain 
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an EBV genome detectable with several different Z3V DNA 
probes. B95-6 is an EBV producer marmoset cell line 
that can be efficiently induced into EBV lytic cycle 
aene expression by phorbol esters (TPA) [33,34] . HE 514- 
16 is an EBV containing cell line, originally from a 
Burkitt lymphoma, that is optimally inducible into EBV 
lytic cycle gene expression by n-butyrate [35,36] . B14 1 
is an EBV -negative Burkitt lymphoma cell line [3 "3 - 
B95-S, KH514-16 and BL41 do not hybridize with the KSKV 
probes. All cell lines were cultured in RPKI 164 C 
medium containing 8% fetal calf serum. 

Tmmunoblcttinc Assays Extracts of uninduced BC3L-1 
cells or 3C3L-1 cells that had been treated with 20ng/ml 
TPA and 3 mM n-butyrate for 4 8 hrs were prepared by 
sonication. HH514-16 cells, treated similarly, served 
to control for antibody reactivity to EBV polypeptides. 
Each lane of a 10% or 12% polyacrylamide gel was loaded 
with extract of 5 X 10 5 cells in SDS sample buffer; 
electrophoresis, transfer to nitrocellulose and blocking 
with skim milk followed standard protocols [38] . Sera 
were screened at 1"100 dilution. The reaction was 
developed by 1.0 fid of 12S I Staphylococcal protein A. 
Radioautographs were exposed to film for 24-4 6 hrs. 
Immunoblotting assays were performed and interpreted on 
coded sera. 



T miming f Increscent assav The antigens were BCBL-1 ceils 
that were untreated or treated with 3mM n-butyrate for 

3 0 4 8 hrs. Cells were dropped onto slides that were fixed 

in acetone and methanol. Sera were tested at 1:10 
dilution, followed by 1:30 dilution of f lucresceinated 
goat anti-human Ig. The reactivity of a serum was 
compared on untreated and n-butyrate treated BCBL-1 

35 cells. Reactivity with 30-50% of the chemically treated 

BC3L-1 cells was considered a positive reaction. All 
immunofluorescence tests were performed on coded sera. 
The two readers were blinded to disease status or 
results of immunoblotting assays. 
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RESULTS 

T r,su-ri=n of -vol* KSHV orc-eins ir. SZ^ 

ce 13. £ : initial experiments using the immur.cbiott ir.g 
technique were designed to determine whether 3=31-1 
cells expressed unique antigenic polypeptides that mignt 
be specific for KSKV infection. Since sera from KIV-l 
infected patients with or without KS would be expected 
tc contain antibodies to EBV polypeptides and since 
3C31-1 ceils are dually infected with KSKV and EBV it 
was essential to distinguish EBV polypeptides from those 
encoded or induced by KSKV. Figures 27A-27B, an 
immunoblot prepared from BCBL-1 cells reacted with a 
reference EBV antiserum, shows that BCBL-1 cells 
expressed two polypeptides, representing the latent 
nuclear antigen E3NA1 and p21, a late antigen complex 
[39] , that were present in other EBV producer cell 
lines, such as B95-B (Figure 27A) and KK514-1S (Figure 
27B and Figures 28A-28D) . When sera from patients with 
KS were used as a source of antibody they failed to 
identify in extracts from untreated BC3L-1 cells 
additional antigenic polypeptides that were not also 
seen in the E3V producer cell lines. However, if 
extracts were prepared from BCBL-1 cells that had first 
2S been treated witha combination of phorboi ester, T?A, 

and n-butyrate, KS patient sera now recognized a number 
cf novel polypeptides that were present int eh 3C3L-1 
cell line but not in standard EBV producer cell 
lines (Figure 273). The molecular weights of the most 
3 0 prominent of these many polypeptides were estimated at 

about 27 KDa, 40 KDa and 60 KDa on 10% polyacrylamide 
gels. These polypeptides were detected within 24 hrs 
after addition of the chemical inducing agents, but were 
not evident in SC3L-1 held in culture for as long as 5 
,5 davs without chemical treatment. Further experiments 

showed that n-butyrate was the chemical agent primarily 
responsible for induction of P 40, whereas ? 60 couic be 
induced by T?A or n-butyrate (Figures 2BA-28D). Since 
p2 7 , p40 and p60 were not detected 



in untreated ceils 
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men: with chemicals they likely 



ana appeared after treat 

represented lytic cycle rather than latent cycle 
polypeptides of KSHV. 

P 40 and r>fiO are KSKV specific: Figures 27A-27B shows 
that antigenic polypeptides corresponding in molecular 
weight to p4 0 were not observed in two E3V producer 
lines, 395-8 and HH514-16, that were induced into the 
E3V lytic cycle by the same chemicals or in comparably 
treated E3V-negative BL41 cells. Furthermore n-butyrate 
strongly induced expression of p40 in BC3L-1 ceils but 
had little or no effect on the level of expression of 
the E3V p21 complex in the same cells. In related 
experiments it was found that n-butyrate also induced an 
increase in the abundance of KSHV DNA and KSHV lytic 
cycle mRKA. TPA, by contrast, induced the EBV lytic 
cycle efficiently' treatment with TPA caused an increase 
in the abundance of the EBV p21 protein and minimal 
induction of KSHV p40. These findings suggested that 
latency to lytic cycle switch of the two gamma herpes 
viruses carried by BCBL-1 cells was under separate 
control and that the p40 complex was specific to the 
KSHV genome . 

25 p40 as a serologic marker for KSHV : While a few highly 

reactive sera, such as KS 01-03, (Figure 273) recognized 
multiple antigenic proteins unique to the chemically 
induced BCBL-1 cells, including p27, p60 and p40, sera 
from other patients with KS did not react with p27 or 
p60 but still recognized p4 0 (Figure 2BA and 2BB) . 
Therefore recognition of p40 was investigated as a 
serologic marker for infection with KSHV. Sera from 89 
KIV-1 infected patients from Connecticut, New York and 
California were examined for presence of antibodies to 
p40; only 3 of 42 patients (7%) without KS had 
antibodies to p40 (p<0.0001 by Chi square) . These three 
patients were homosexual or bisexual men from New York 
city. The positive and negative predictive values of 
the serologic marker for the presence of KS were 64% and 
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78% respectively. Three HIV-i inferred men from New 
York with non-Hodgkin's lymphoma bur without KS were 
non-rea:tive to the KSHV p40 antigen. Figure 23 
compares the patients with KS whose serum did or did net 
5 contain antibodies to KSHV p40. Neither CD4 cell number 

nor the extent of KS disease predicted the presence cr 
absence of a serologic response to p4C. 

Tmmunof 1 uorescence assavs : Imifenobicts showed that n- 
0 butyrate induced expression of KSHV lytic cycie 

polypeptides in BCBL-1 cells without significantly 
affecting expression of EBV polypeptides (Figure 28A) . 
Therefore it was reasoned that n-butyrate might also 
induce many more BCBL-1 cells into the KSHV lytic cycle 
IS than into the EBV lytic cycle. Using indirect 

immunofluorescence with a reference human antiserum, RM 
in Figure 2 7B, that contains antibodies to E3V but not 
KSHV there were about 2% antigen positive untreated 
3C3L-1 cells and a similar number of antigen positive 
20 BCBL ceil that had been treated with n-butyrate . Serum 

01-03 that is EBV-positive and ^S-positive (Figure 27B) 
detected 2% antigen positive ce5ls in the untreated BCBL 
population, presumably the EBV expressing ceils, while 
it detected 50% antigen positive BCBL-1 ceils that had 
25 been treated with n-butyrate. This increase in the 

number of antigen positive BCBL-1 cells among the n- 
butyrate treated population served as the basis of an 
immunofluorescence screening assay for antibodies to 
KSHV lytic cycle antigens (Figures 29A-29F) . The 
30 results of the immunofluorescence assay were nearly 

identical to the immunoblotting . assay (Figure 26). 
Among 89 sera there were only 4 (3%) that were 
discordant in the two assay*. Three sera scored 
positive by I FA and negative by immunoblotting: one was 
35 considered positive by immunoblotting and negative by 

IFA. 68% of patients with KS and 12% of HIV- 1 infected 
oatients without KS were reactive by indirect 
Immunofluorescence assay (IFA). Thus using two 

different assays, antibodies to KSHV lytic cycle 
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antigens were found 6 to 9 times more frequently among 
patients with KS than among HIV-1 infected patients 
without KS. Stated another way, among individuals who 
were seropositive to KSKV p40 32/35 (91%) had KS . Among 
5 those seropositive by the immunofluorescence assay 32/37 

(86%) had KS . Thus infection with KSHV, as defined "by 
t hess serologic markers, carries a high risk cf 
development cf KS . 

1C DISCUSSION 

The recent discovery of genetic sequences representative 
of a new human herpes virus in KS tumor tissue, taken 
tooether with past epidemiologic observations, strongly 

15 implicate this novel agent in the pathogenesis of KS . 

However, these observations, by themselves, do not 
permit the construction of a unified theory of 
pathogenesis that accounts for the many mysterious 
features of KS . For example, the relative contribution 

20 of KIV-1, other forms of immunosuppression, geographic 

factors, sex differences, the role of cytokines and 
growth factors, and the occurrence of distinct clinical 
variants must all be eventually understood. By 
identifying the infection rate in different populations 

25 a serologic marker for infection with KSKV would be 

great aid in unravelling the significance of the new 
virus in this complicated puzzle. 

One possibility is that KSKV, the putative etiologic 
3 0 agent is, like all the other human herpes viruses, a 

ubiquitous, or at least widespread virus which infects 
large segments of the human population. Individuals who 
are immunosuppressed would have a greater likelihood of 
developing disease, whereas immunocompetent individuals 
3 5 would remain healthy. This pathogenetic model is 

similar to that postulated for the role that EBV plays 
in non-Hodgkin's lymphoma or cytomegalovirus in 
retinitis in patients with AIDS. If this model is 
correct a very high proportion of the adult human 
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population might be found to be seropositive for KSHV . 
The model of a ubiquitous virus selectively causing 
disease in immunodef icient individuals does not account 
f-r classical KS affecting patients who are net 
5 immunocompromised nor does it account tcr tne 

observations that endemic KS in Africa preceded the KIV- 



13 



I epidemic. Since many African patients 
HIV-1 negative other co-factors must be implicated. 

The other possibility is that KSHV infection occurs 
selectively in the human population. Transmission may 
be promoted by sexual behavior thai: also carries a high 
risk of acquiring HIV-1. In this scenario 

seroprevalence cf KSHV would be expected to be higher in 
HIV-1 seropositive and HIV-1 seronegative homosexual men 
than in other populations. If the virus alone were 
capable of inducing disease, acquisition cf KSHV 
infection, as monitored by the presence of antibody, 
would be associated with a high rate of clinically 
evident KS . However, if KSHV infection needed to 
accompanied by other co- factors to cause disease, the 
prevalence of antibody of KSHV might be similar among 
patients with and without KS . The other co- factors 
would not be identified in a serologic test for 
antibodies to KSHV antigens. 

The findings, using tests for antibodies to KSHV lytic 
cycle antigens, are consistent with the general model in 
which infection with KSHV is infrequent but associated 
with a high rate of apparent disease. Only a few HIV-1 
infected patients without KS had antibodies to the KSHV 
lytic cycle antigens; by contrast a very high proportion 
cf HIV-1 infected men who had' clinically evident KS were 
seropositive. This finding suggests that a high 
proportion of individuals who are dually infected with 
HIV-1 and KSKV develop KS . However, another 

interpretation of the data is possible, though this 
interpretation is novel and no other examples are known 
among the human herpes virus family. Infection with 
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KSHV might: be ubiquitous, antibodies to the virus woulc 
not normally be detected in healthy infected 
individuals. Antibodies would only appear after the 
virus has been reactivated from the latent :-:c the 
lyric cycle as might occur during the course of 
immunosuppression. Thus the two serologic tests that 
are described would indicate reactivated infection out 
would not be an index of past exposure to the virus. If 
this interpretation is correct, it should be possible to 
demonstrate KSHV DNA sequences or tot isolate the virus 
from healthy individuals who are KSHV seronegative. 

Reaardless of which of these two interpretations is 
correct, the serologic studies provide a strong 
correlation between the presence of antibodies to KSHV 
lytic cycle gene products and clinical KS . Nonetheless 
there are two groups of patients whose serologic results 
require further explanation. One group consists of the 
few patients with positive serology for KSHV p40 without 
clinical KS . They may have subclinical or visceral 
disease, or they may develop KS in the future. The 
other group is the approximately 30% of patients with KS 
whose sera lacked antibody to p40. The patients with KS 
who were p4 0 seronegative were not misclassif ied since 
the diagnosis was confirmed in all of them by biopsy 
(Fiaure 25) . It is possible that the antibodies being 
measured are variable and wax and wane with time 
following infection. The appearance of antibody to p40 
may reflect the extent of lytic viral replication which 
mav vary during different phases of the disease. To 
determine whether this is true prospective studies 
including serial bleedings are required. 

P 4 0 is likely to be only one among a number of KSHV 
antigens recognized by the infected patients. Antibody 
recognition of other KSHV antigens may not be possible 
on immunoblots because they comigrate with EBV 
polypeptides, because the BC3L-1 ceils cannot be induced 
"to express these antigens, or because the antigens are 
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of low abundance or denatured on the immunobiots . .r. 
some individuals serum antibodies tc p4 0 may be consume:: 
in immune complexes with p40 antigen in the circulation. 
Thus detection of p4 0 on immunoblcts may net be c: 
optimal sensitivity. In this connection three sera 
recocnized antigens in immunofluorescence tests but did 
not react with p40 on western blots. The serologic test 
employing whole BC3L-1 ceils as antigen are clearly 
first generation assays to be improved by better 
characterization of the KSKV gene products and 
preparation of recombinant antigens. 

Lack of a serologic response to p40 could also reflect 
severely impaired humoral immunity. Although humoral 
immunity is usually relatively intact in KIV infection, 
examples of impaired antibody response have been 
described. For instance, some individuals are known to 
have impaired antibody responses to parvovirus 319(40 
and others have been observed to lose antibodies to 
hepatitis B surface antigen {41] . An association 
between the degree of immunosuppression, as monitored by 
the number of CD4 cells, and the presence or absence of 
antibody p40 among patients with KS was not found 
(Figure 25) . Furthermore all the patients with or 
without antibodies to KSKV p40 had antibodies to E3V p21 
sucraesting an intact humoral immune response. 

In these serologic studies, as in the genetic probe 
studies previously reported, KSKV infection was found in 
the majority, but not all, patients with KS . Assuming 
that methodologic explanations do not account 
exclusively for the seronegative patients, other 
pathways, in addition to infection with KSKV , may lead 
to development of KS . In fact, most data suggest that 
the pathogenesis of KS is a multifactorial process. It 
has been observed that the product of the KIV- tat gene 
stimulates growth of KS tissue culture ceils [42] and 
can induce KS-like lesions in mice [43] . These findings 
suogest a direct role for KIV- I in the pathogenesis of 
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KS, at least in HIV-infected hosts. In other settings, 
other growth factors may play a similar or complementary 
function. Interleukin- 6 and basic fibroblast growth 
factor are both known stimulate growth cf KS cells 
invirro [44] , interleukin-6 is also produced in AIDS-KS 
derived cell culture [44] . Thus, KS pathogenesis may 
involve autocrine and paracrine growth factors together 
with infection with KSKV in some patients cr with 
certain strains of HIV-l in other patients. If 
infection with KSKV is the sine qua zzon of this process 
on would expect to see evidence of KSKV infection m a - _ 
patients with KS . 

In summary, an immunoblotting and a immunofluorescence 
screening assay for detection of antibodies to lytic 
cycle antigens of KSHV is disclosed. These assays 
should permit detailed seroepidemiologic investigations 
cf KSKV. The findings support the notion of a strong 
association between infection with KSHV and the 
development of KS in HIV-infected patients. Infection 
with KSHV , as defined by these serologic assays, appears 
to carry an extremely high risk of development of 
clinical KS . 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(^} APPLICANT: The Trustees of Columbia University in the Ciry c 

New York City 

L r INVENTION: UNIQUE ASSOCIATED KAPOSI'S SARCOMA VIRUS 

v--- - - SEQUENCES AND USES THEREOF 

£iii) NUMBER OF SEQUENCES: 45 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Cooper & Dunham LLP 

CB) STREET: 118 5 Avenue of the Americas 

(C) CITY : New York 

(D) STATE: New York 
{ E ) COUNTRY : U.S.A. 
(F) ZIP: 10036 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE : Patentln Release #1.2 5 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: White, John P. 

(B) REGISTRATION NUMBER: 28,678 

(CJ REFERENCE /DOCKET NUMBER: 45185- C-PCi / JP VMS i_ 

t -. x \ -£ - £ C OMMUN I CAT I ON I NFORMAT I ON : 

(A) TELEPHONE: (212) 278-0400 

(B) TELEFAX: (212) 391-0525 

(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20710 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL : N 
(iv) ANTI -SENSE: N 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 
TCGAGTCGGA GAGTTGGCAC AGG C CTTGAG CTCGCTGTGA CGTTCTCACG GTGTTGGTTG 60 
GGATCAGCTG GTGACTCAGA CAAGTCTTGA GCTCTACAAC GTAACATACG GGCTGATGCC 12 0 

CACCCGATAC CAGAATTACG CAGTCGGCAA TTCTGTGCCC TAGAGTCACC TCAAAGAATA 180 
ATCTGTGGTG TCCAAGGGGA GGGTTCTGGG GCCGGCTACT TAGAAACCGC CATAGATCGG 240 
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GCAGGG7GGA G7AC7TGAGG AGCCGGCGGT AGG7GGCCAG GTGGGCCCGG TTACCTGG7C 3 0C 

7777GCG7GC 7GC7GGAAGC C7GC7ZAGGG ATTTrTTAAC C7CGGCC7CG G77GGACG7A 35: 

CCA7GGCAGA AGGCGG7777 GGAGCGGAC7 CGGTGGGGCG CGGCGGAGAA AAGGCC7C7G 4 2C 

7GAC7AGGGG AGGCAGG7GG GAC77GGGGA G C7CGGACGA CGAA7CAAGC A7C7CCACAA 48 C 

CCAGCACGGA 7A7GGACGAC C77CC7GAGG AG AGG AAAC C AC7AACGGGA AAG7C7G7AA 54 C 

AAACC7CG7A CATATACGAC G7GCCCACCG 7CCCGACCAG CAAG CC G7GG CA777AA7G Z 6 0 0 

ACGACAAC7C CC7C7ACGCA ACGCC7AGG7 77CCGCCCAG ACC7C7CA7A CGGCACCC77 66 C 

CCGAAAAAGG CAGCA77777 GCCAG7CGG7 TG7CAGCGAC TGACGACGAC 7CGGGAGAC7 72 C 

ACGCGCCAA7 GGA7CG777C GCC77CCAGA GCCCCAGGG7 G7G7GG7CGC CC7CCCC77C 78 0 

CG 7 C7 Z CAAA 7CACCCACC7 CCGGCAAC7A GGCCGGCAGA CGCG7CAA7G GGGGACGTGG 64 C 

G7TGGGCGGA 7C7GCAGGGA C7CAAGAGGA CCCCAAAGGG A77777AAAA ACA7C7ACCA 90 0 

AGGGGGGCAG 7C7CAAAGCC CG7GGACGCG A7GTAGGTGA CCG7C7CAGG GACGGCGGC7 96 0 

77GCC777AG 7CC7AGGGGC G7GAAA7CTG C CA7AGGG C A AAA CA77AAA TCA7GG77GG 102 0 

GGA7CGGAGA A7CA7CGG7G AC7GCTGTCC CCGTCACCAC GCAGC77A7G G7ACCGG7GC 108 0 

ACC7CA77AG AACGCC7G7G ACCG7GGAC7 ACAGGAATG7 7TA77TG777 7AC77AGAGG 114 0 

GGG7AA7GGG 7G7GGGCAAA 7CAACGC7GG 7CAACGCCG7 GTGCGGGA7C 77GCCCCAGG 12 00 

AGAGAG7GAC AAG7777CCC GAGCCCA7GG 7G7AC7GGAC GAGGGCA777 ACAGA77G77 12 6 0 

ACAAGGAAA7 77CCCACC7G A7GAAG7C7G G7AAGGCGGG AGACCCGC7G ACG7C7GCCA 132 0 

AAA7A7AC7C A7GCCAAAAC AAG777TCGC 7CCCCTTC7G GACGAACGCC ACCGC7A7CC 1380 

73CGAA7GA7 GCAGCCC7GG AACG77GGGG GTGGG777GG GAGGGGCAC7 CA77GG7G CG 144 0 

77777GA7AG GCA7C7CC7C 7CCC7AGCAG TGGTG7TCCC TCTCA7GCAC C7GAAGCACG 15 0 0 

GCCGCC7A7C 7777GA7CAC 77C777CAA7 7AC77TCCA7 CTTTAGAGCC ACAGAAGGCG 156 0 

ACG7G37CGC CA77C7CACC C7C7CCAGCG CCGAG7CG7T GCGGCGGG7C AGGGCGAGGG 16 2 0 

GAAGAAAGAA CGACGGGACG G7GGAG7AAA AC7ACA7CAG AG AA77GG CG TGGGC77A7C 16 80 

ACGCCG7G7A C7G77CA7GG A7CA7G77GC AG7ACA7CAC TG7GGAG CAG A7GG7ACAAC 174 0 

7A7GCG7ACA AACCACAAA7 A77CCGGAAA 7C7GC7TCCG CAGCGTGCGC C7GGCACACA 1800 

AG G AG G AAA C 77TGAAAAAC C7TCACGAGC AG AG CATG 77 ACC7A7GA7C ACCGG7G7AC 1860 

7GGA7CCCG7 GAGACA7CA7 CCCG7CG7GA 7CGAGC77TG C7777G77TC 77 C AC AG AG C 192 0 

7GAGAAAA77 ACAA777A7C G7AGCCGACG CGGA7AAG77 CCACGACGAC G7A7GCGGCC 198 0 

7G7GGACCGA AA7 C7ACAGG CAGA7CC7G7 CCAA7CCGGC 7A77AAACCC AGGGCCA7CA 2 04 0 

AC7GG CCAGC A77 AG AG AG C CAG7C7AAAG CAG77AA7CA C C7 AG AG GAG ACA7GCAGGG 2100 

777AGCC77C T7GGCGGCCC 7TGCA7GC7G GCGA7GCA7A 7CG77GACA7 G7GGAGCCAC 216 0 

7GGCGCG77G CCGACAACGG CGACGACAA7 AACCCGC7CC GCCACGCAGC 7CA7CAA7GG 222 0 

GAGAACCAAC CTCTCCATAG AACTGGAA77 CAACGG CAC7 AG77777T7C 7AAA77GGCA 2280 
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AAATCTGTTG AATGTGATCA CGGAGCCGGC CCTGACAGAG TTGTGGACCT CCGCCGAAGT 2 34 C 
CGCCGAGGAC CTCAGGGTAA CTCTGAAAAA GAGGCAAAGT CTTTTTTTCC CCAACAAGAC 24 OC 

AG TTGTG AT C TCTGGAGACG GCCATCGCTA TACGTGCGAG GTG "CGACGT CGTCGCAAAC 24 6 C 

TTATAACATC ACCAAGGGCT TTAACTATAG CGCTCTGCCC GGGCACCTTG GCGGATTTGG 2 52 C 

GATCAACGCG CGTCTGGTAC TGGGTGATAT CTTCGCATCA AAATGGTCGC TATTCGCGAG 2 58C 

GGACACCCCA GAGTATCGGG TGTTTT AC CC AATGAATGTC ATGGCCGTCA AG TTTT C CAT 264 0 

ATCCATTGGC AACAACGAGT CCGGCGTAGC GCTCTATGGA GTGGTGTCGG AAGATTTCGT 2 70C 

GGTCGTCACG CTCCACAACA GGTCCAAAGA GGCTAACGAG ACGGCGTCCC AT CTT CTGTT 2 76C 

CGGTZTCZCG GATTCACTGC CATCTCTGAA GGGCCATGCC ACCTATGATG AACTCACGTT 2B2C 

CGCCCGAAAC GCAAAATATG CGCTAGTGGC GATCCTGCCT AAAGA7TCTT ACCAGACACT 2 88 0 

CCTTACAGAG AATTACACTC GCATATTTCT GAACATGACG GAGTCGACGC CCCTCGAGTT 2 94 0 

CACGCGGACG AT C C AG AC CA G G AT CG TAT C AATCGAGGCC AGGCGCGCCT GCGCAGCTCA 3 000 

AGAGGCGGCG CCGGACATAT TCTTGGTGTT GTTTCAGATG TTGGTGGCAC ACTTT CTTGT 3 06 0 

TGCGCGGGGC ATTGCCGAGC ACCGATTTGT GGAGGTGGAC TGCGTGTGTC GGCAGTATGC 312 0 

GGAACTGTAT TTTCTCCGCC GCATCTCGCG TCTGTGCATG CCCACGTTCA CCACTGTCGG 3180 

GTATAACCAC ACCACCCTTG GCGCTGTGGC CGCCACACAA ATAGCTCGCG TGTCCGCCAC 3 24 0 

GAAGTTGG CC AGTTTGCC CC GCTCTTCCCA GGAAACAGTG CTGGCCATGG TCCAGCTTGG ' 3 3 00 

CGCCCGTGAT GGCGZCGTCC CTTCCTCCAT TCTGGAGGGC ATTG CTATGG TCGTCGAACA 336 0 

TATGTATACC GCCTACACTT ATGTGTACAC ACTCGGCGAT ACTGAAAGAA AATTAATGTT 34 2 0 

G G A CAT A C A C ACGGTCCTCA CCGACAGCTG CCCGCCCAAA GACTCCGGAG TATCAGAAAA 34 8 0 

GCTACTGAGA ACATATTTGA TGTTCACATC AATGTGTACC AACATAG AG C TGGGCGAAAT 3 54 0 

GATCGCCCG2 TTTTCCAAAC CGGACAGCCT TAACATCTAT AGGGCATTCT CCCCCTGCTT 36 00 

TCTAGGACTA AGGTACGATT TGCATCCAGC CAAGTTGCGC GCCGAGGCGC CGCAGTCGTC 366 0 

CGCTCTGACG CGGACTGCCG TTGCC AG AGG AACATCGGGA TTCGCAGAAT TGCTCCACGC 3 72 0 

GCTGCACCTC GAT AG CTTAA ATTTAATTCC GGCGATTAAC TGTTCAAAGA TTACAG C CG A 3 78 0 

CAAGATAATA GCTACGGTAC CCTTGCCTCA CGTCACGTAT AT CATC AG TT CCGAAG CACT 3 84 0 

CTCGAACGCT G TT GT CT A CG AGGTGTCGGA GATCTTCCTC AAGAGTG CCA TGTTT AT AT C 3 9 00 

TG CT AT C AAA CCCGATTGCT C CGG CTTTAA CTTTT CT C AG ATTGATAGGC ACATTCCCAT 3 96 0 

AGTCTACAAC ATCAG CACAC CAAGAAGAGG TTGCCCCCTT TGTGACTCTG TAATCATGAG 4 02 0 

CTACGATGAG AGCGATGGCC TGCAGTCTCT CATGTATGTC ACTAATGAAA GGGTGCAGAC 4 08 0 

CAA CCTCTTT TTAGATAAGT CACCTTTCTT TGATAATAAC AACCTACACA TTCATTATTT 414 0 

GTGGCTGAGG GACAACGGGA CCGTAGTGGA GATAAGGGGC ATGTATAGAA GACGCGCAGC 4200 

CAGTGCTTTG TTTCTAATTC TCTCTTTTAT TGGGTTCTCG GGGGTTAT CT ACTTT CTTTA 4 26 0 

CAGACTGTTT TCCATCCTTT ATTAGACGGT CAATAAAG CG TAGATTTTTA AAAGGTTTCC 4 32 0 
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TGTGCA7TCT TTTTGTATGG GCATATACTT GGCAAGAAAT CCGAGCACCT CAG AAAG TGG 438 C 

ATTGCCGTCA CAT AT CAG TT CGACCACCCC TGCACCTAGC CATGCGGCGC TTTGACGGTC 444 0 

TTTGGGGCTA C AC AT CAT AA AGTACTTTTC CATGGCTTCT ATAAGCACCT TGGAACAATC 4 5 00 

TGGGGGTTGG CGAATGGGTT CCCTAAACGG GAAATCCTCT ATGGTATTCA GGCAGAAGAC 4 55 0 

CGCGTCCTCC ACCCGACGTT TGAGTCTTTC TAG CAG AG CG CCGAAGAACT CZCGGTCGTG "4620 

TGTTTTCGCA GGGGCAAGTT CTGCGCCGTA CAGCGATGAG AAACACGACA CGATGTTTTC 4 6 8C 

CAGCCCCATG CTGCGCAGCA ACACGTGCTT CAGGAACAGG TGTTGTAGCC GG7TCA3TTT 4 74 0 

TA3CTTGGGT AGAAAAGTTA TCGAGTTGTT AGCACGCTCC ATGATGGTAA CGGTGTTGAA 4B0C 

GTCACAGACC GGGCTTTCTC CGAGTCTCGG CCGCCTGAGT CCAATCATGT AG AA CAT AG A 4 86 0 

CGCGGCCTCG TTGTCTGTGT TAAGTGACAC GATATCCCGT TCGCAAACCT GTGCGATGTT 4 52 0 

GTGTTTCAGT AT AG AT CTGG TCTGACCGGC ACGGGGTGTT ATGGGGTGAC GCGGTAAAGG 4 98 0 

CGACTCTGGG T CAAACAC CT TTATGCGGTT GGCGGCCTCG TCGATGACGA CACGCTTGTT 5 04 0 

CGCGGCGTGT ATGGGGACGC GACGGCATCC CGCTGGC AG A TCTATAATCT T AAAG TT GG T 5100 

ATAAGACTGG TCGCTCGTTA TGGCCAGCCG GCACTCCGGT AGTATCTGCG TGTCCTCGAA 516 0 

TTCGTGGCCG CGTACGACTG GCTTGGAGTG CAGGTAAACG CCAAGAGATG CGGTCTCTTC 5220 

GCCTACGCAC AAGTGGCTTC TTAACGCGTA GGGGTGCGGT GAGAG CATG A TCCGTAGCAA 52 8 0 

CGATAGTTCC GGGTGCCTAG CCGCGTAGAG TGGCAGGGTA GACGAGTCCG GAGTCCCAAA 534 0 

CTTTTCGAAC AACAGTGGCA TCGGGACTTC AGGATTAGAG ACTCCCACCA TGGCCGCC^C 54 0 0 

CGCCGGAGAG GTCAAGACGT GAAACACG CG CTCGCCTGTC GACAGGCGCG CCGCGCCCTC 54 6 0 

TACTAGACTA GCCTTCACGT CCGGAACTCG TAACAT AG CT TAG AC CAG CG GACGGACGCA 5 52 0 

ACGTACGCGG GGATCGGCTG GCGGTGTCTG CTCGTTGGAC GCGGCCGTTC GGTGGCGCCA 5 58 0 

GTGCAGGCCT AGTTTGCGAA TGGCGTGACG GACAATTTGT GG CTTT AG AG CGGCGAACCG 5640 

ATGACCCGTG GTGGCGACGA ACGAAATGAA GTTTG CATTG CGGCCCAACT CGTCTAGCCT 5700 

GGTCTTCTTG TTTCGGG CAT AGATTTTCGG GATTAGGTTA CACTTTTTAT ATCCCAGTAC 576 0 

TGCGCACTCG TGTTTGCTTT TAGTGTGACT GATTATCTTC TTTGAGAAGT CAAACAGGCC 5 82 0 

CCGGGCGGCG GCTCGCCTAA TGCAAGCCAC GTCAAGCCTG AGAAACGAAC AG CATT C C AC 5880 

CAGACACTCC AGGAACCTTT TGTGTAGCGT CTGTATTTGG GAACGGTTTC TGTGCTCAAG 5 94 0 

TAGGGAGAAT ATT CTATTTT TGTTTCCGTC GATGCGCGCG TGCTGGTCCG TGAGAATGGG 6 00 0 

CGCCAGCTCG TGGCGAATCT G TT C CACAAG AGGCTGCCCG TACACTTTAG AAATCGTGGC 6 060 

TGTCGCGGCC TTAAAC CAGG ACACGTTTAG CCCATCCTTG CTGGAGACCA CAGATGGAAA 612 0 

GTTTGTGGTC CAAAATACGT TTTTTCGCCC CATTCTCACC ATGTACTGGT TTTCCAGTCC 6180 

GTGCAGGTCC AACGTGGAGT TCCAATTTGC TATCGATACA GGAAATATGT GCCTGA7TGG 624 0 

CAGAAAGCAT TT CAG C G T AC CCATTGCGAA G AG AAAGTG C AGCATGTCCC CACTGATGTT 63 00 

GATGTTTATT GCGGTGCCTT G A CACATGTT GTCGGAAAAA AACACG CTTA 7GGTAAAAGA 6 36 0 
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AGGTTCCTTT ACGGAGTACT TTCGTATAAC AAAATTGTTG GTCAATCTGG GGATGTTTAA 

AAT AG T CTTT TGCAGGGTGT TAGGAACGTG GCAGCTTATC TTAGTGTTAA TCACCATGTT 6-16: 

GGTGTTGAAT ATGGTGATCT TGAAGTTTTC CAAACTGACG TGTTTTGTGG G TT C C AG CAT £54C 

GTCTGACACT GTAGAGCTGC CCAGAGTCCG CGCGTCCGTG GCCGCGTATC GTTGGAAGCA 6 6 C 0 

CGCCTG CAAA TTT C CTTT CA TGGCTGCTCG CCGGTCTTTC GGCGCGTACC GGATTCTTGA 6 66 0 

AAGCGTCGCC GCCAGGAGAC GCGGTGTCTC GTGGGTGCCT AAAAAGTTTG CGCAGGGGTG £~2Z< 

CAGTCCGCTG CACGAGTGGC CGATGCAGTC TGCCACTGCC ATACACATGA CGAGTCTGTA £78C 

GATGGCCGGT GTGCCCGGAT AC A CTAG ATA GTAGGTACAA TCTGGGGTAC TGACGACCAC 684? 

CCTGTATGGC TTTGGTCCGG GGTCCTTGCG TTGGATTTTT ACGTG GAG AC GGGACACGAG 6 90C 

CTGGTTTAGA GCCAGCTGAA AGCCCACCAG ATCCCGTCCG TTAAC CTTGA CGTCCTGGTG 6 9SC 

CTTACT CTGT TTCGACAGGT TCTTCAGCAC GGTGGG CAGT CGCTCTACGT TGTGAGCGAT 7 02 0 

GGCACGGCGC AG C GAG AC CA GCTCTCCGTG CCACCCCCAC GTGGCCATGA AGCTGCTGAT 708 0 

GTTAAACTTT AAAAAATGTA GCTGTGCGTC TGGGGATGCG GGTGG CATTA TTGAAAACGA 714 0 

GAGATGCTTC AGGCTCTCCA GGAGTGCAAA ATAATTTTGA TAGATTGTGG GTTGTAGACT 72 0 0 

ATGGGGCAAC ACCGCCAGAA ACG CATG AAA ACACTGTTCG AACTCCCAGA ACTCCAGGTA 72 6 0 

CCTGCACACT ATCCTGAACA TGG UTT TGTA ACATATGGTG CACGTTAGTA GCGCGGGAAG 73 2 0 

AT AC AG CG AG CGTAGCTCCC TGAATTCGCA GGGTTTATCA CAATCATCGG TAAGTTCCCA 73 8 0 

TGATCCCACC GCAGGTAGGT AGTTGTCGGT GTCTATCTGT CCGCGCGTAA ACACTCCACC 744 0 

ACCGTCAATT ATTAAAC CTT CGCCGCTGTA CCGTCGACCC ACTTTT C C C A AAAGAGTCCC 7 5 00 

TTCTTGATGT ATAAAAGGGT GGAGGCGTTC CCCCAGGAGT AGTCTGCGTA TCGCTCTG CA 7 56 0 

GGCGAAAAAG GTGGGCTCGG GCTGCATCAT CTTATCAAGA CCTTCTAAGG TCAGCTCTGC 76 2 0 

CTGCAGGTGC GAGTTGGTGG CCAGACAGCA GAATATTTCC AG CTGTGATT CCCAAGTCGC 76 80 

TTGATAACAC GTGGTCTGCG GACTCGTCGT CAGGGAGG CG CTCGGTGGCA GTAGTAGGGG 7 74 0 

GCCCTCGAGC GCTGCCATGG AGG CG AC CTT GGAGCAACGA CCTTTCCCGT ACCTCGCCAC 7800 

GGAGGCCAAC CTCCTAACGC AGATTAAGGA GTCGG CTGCC GACGGACTCT T CAAG AG CTT 7860 

TC AG CTATTG CTCGGCAAGG ACGCCAGAGA AGGCAGTGTC CGTTTCGAAG CGCTACTGGG 7920 

C G TAT AT AC C AATGTGGTGG AGTTTGTTAA GTTTCTGGAG ACCGCCCTCG CCGCCGCTTG 7980 

CGTCAATACC GAGTTCAAGG ACCTGCGGAG AATGATAGAT GGAAAAATAC AGTTTAAAAT B04 0 

TTCAATGCCC ACTATTGCCC ACGGAGACGG GAGGAGGCCC AACAAGCAGA G A C AG TAT AT 8100 

CGTCATGAAG G CTTG CAATA AG CAC CACAT CGGTGCGGAG ATTGAG CTTG CGGCCGCAGA 816 0 

CAT CG AG CTT CTCTTCGCCG AGAAAGAGAC GCCCTTGGAC TTCACAGAGT ACGCGGGTGC 822 0 

CAT CAAG ACG ATTACGTCGG CTTTGCAGTT TGGTATGGAC GCCCTAGAAC GGGGG CTAGT 828 0 

GGACACGGTT CTCGCAGTTA AACTTCGGCA CGCTCCACCC GTCTTTATTT TAAAGACGCT 8 34 0 

GGGCGATCCC GTCTACTCTG AG AGGGG CCT CAAAAAGGCC GTCAAGTCTG ACATGGTATC 84 00 
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CATGTTCAAG GCACACCTCA TAGAACATTC ATTTTTTrTA GATAAGGCCG AG CT CATG A C 64 6 C 

AAGGGGGAAG CAGTATGTCC TAACCATGCT CTCCGACATG CTGGCCGCGG TGTGCGAGGA 6 52 0 

TACCGTCTTT AAGGGTGTCA GCACGTACAC CACGGCCTCT GGGCAGCAGG TGGCCGGCGT E56C 

CCTGGAGACG ACGGACAGCG TCATGAGACG GCTGATGAAC CTGCTGGGGC AAGTGGAAAG 864 0 

TGCCATGTCC GGGCCCGCGG CCTACGCCAG CTACGTTGTC AGGGGTGCCA ACCTCGTCAC 8 70 0 

CGCCGTTAGC TACGGAAGGG CGATGAGAAA CTTTGAACAG TTTATGGCAC GCATAGTGGA 87 6 0 

CCATCCCAAC GZTZTGZCGT CTGTGGAAGG TGACAAGGCC GCTCTGGCGG ACGGACACGA 8 62 0 

C GAG ATT CAG AGAACCCGCA TCGCCGCCTC TCTCGTCAAG ATAGGGGATA AGTTTGTGGC 86 6 0 

CATTGAAAGT TTG CAG CG CA TGTACAACGA GACTCAGTTT CCCTGCCZAZ TGAACCGGCG E94C 

CATCCAGTAC AC CTATTTCT TCCCTGTTGG CCTTCACCTT CZZGTGZCZZ GCTACTCGAC 900 0 

ATCCGTCTCA GTCAGGGGCG TAGAATCCCC GGCCATCCAG T C G AC CG AG A CGTGGGTGGT 906 0 

TAATAAAAAC AACGTGCCTC TTTGCTTCGG TTACCAAAAC GCCCTCAAAA GCATATGCCA 912 0 

CCCTCGAATG CACAACCCCA CCCAGTCAGC CCAGGCACTA AACCAAGCTT TTCCCGATCC 918 0 

CGACGGGGGA CATGGGTACG GTCTCAGGTA TGAGCAGACG CCAAACATGA AC CTATTCAG 924 0 

AACGTTCCAC C AG T ATT ACA TGGGGAAAAA CGTGGCATTT GTTCCCGATG TGGCCCAAAA 93 00 

AGCGCTCGTA ACCACGGAGG ATCTACTGCA CCCAACCTCT CACCGTCTCC TCAGATTGGA 936 0 

GZTCZ&CZZC TTCTTTGATT TTTTTGTGCA CCCCTGTCCT G G AG CG AG AG GATCGTACCG 942 0 

CGCCACCCAC AGAACAATGG TTGGAAATAT ACCACAACCG CTCGCTCCAA GGGAGTTTCA 94 8 0 

GGAAAGTAGA GGGGCGCAGT TCGACGCTGT GACGAATATG ACACACGTCA TAGACCAGCT 954 0 

AACTATTGAC G T CAT AC AGG AGACGG CATT TGACCCCGCG TATCCCCTGT TCTGCTATGT 96 00 

AATCGAAGCA ATGATTCACG GACAGGAAGA AAAATTCGTG ATGAACATGC CCCTCATTGC 96 6 0 

CCTGGTCATT CAAACCTACT GGGTCAACTC GGGAAAACTG GCGTTTGTGA ACAGTTATCA 972 0 

CATGGTTAGA TTCATCTGTA CGCATATTGG G AATGGAAG C ATCCCTAAGG AGGCGCACGG 97 8 0 

CCACTACCGG AAAATCTTAG G CG AG CT CAT CGCCCTTGAG CAGGCGCTTC TCAAGCTCGC 984 0 

GGGACACGAG ACGGTGGGTC GGACGCCGAT CACACATCTG GTTTCGGCTC TCCTCGACCC 9 90 0 

GCATCTGCTG CCTCCCTTTG CCTACCACGA TGTCTTTACG GATCTTATGC AG AAGT CATC 9 96 0 

CAGACAACCC ATAAT CAAG A TCGGGGATCA AAACTACGAC AACCCTCAAA ATAGGGCGAC 1002 0 

ATT CAT C AAC CTCAGGGGTC G C ATGG AG G A CCTAGTCAAT AACCTTGTTA ACATTTACCA 10 08 0 

GACAAGGGTC AATGAGGACC ATG A C GAG AG ACACGTCCTG GACGTGGCGC CCCTGGACGA 1014 0 

GAATGACTAC AACCCGGTCC TCGAGAAGCT ATTCTACTAT GTTTTAATGC CGGTGTGCAG 102 00 

TAACGGCCAC ATGTGCGGTA TGGGGGTCGA CTATCAAAAC GTGGCCCTGA CG CTGACTTA 102 6 0 

CAACGG ZZZC GTCTTTGCGG ACGTCGTGAA CG CACAGGAT GAT ATT CT A C TGCACCTGGA 10320 

GAACGGAACC TTGAAGGACA TTCTGCAGGC AGG CGACATA CGCCCGACGG TGGACATGAT 10380 

CAGGGTGCTG TGCACCTCGT TTCTGACGTG CCCTTTCGTC ACCCAGGCCG CTCGCGTGAT 10440 
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CACAAAGCGG GACCCGGCCC AG AG TTTTG C CACGCACGAA TACGGGAAGG ATGTGGCGCA 
GACCGTGCTT GTTAATGGCT TTGGTGCGTT CGCGGTGGCG GACCGCTCTC GCGAGGCGGC 
GGAGACTATG TTTTATCCGG TACC CTTTAA CAAGCTCTAC GCTGACCCGT TGGTGGCTGC 
CACACTGCAT CCGCTCCTGC CAAACTATGT CACCAGGCTC CCCAACCAGA GAAAGGCGGT 

GGTCTTTAAC GTGCCATCCA ATCTCATGGC AGAATATGAG GAATGGCACA AGTCGCCCGT .1074 C 

CGCGGCGTAT GCCGCGTCTT GTCAGGCCAC CCCGGGCGCC ATTAG CGCCA TGGTGAG CAT 10800 

GCACCAAAAA CTATCTGCCC C C AGTTT CAT TTGCCAGGCA AAACACCGCA TGCACCCTGG 1066: 

TTTTGCCATG ACAGTCGTCA GGACGGACGA GGTTCTAGCA GAGCACATCC TATACTGCTC 1CS2C 

CAGGGCGTCG ACATCCATGT TTGTGGGCTT GCCTTCGGTG GTACGGCGCG AGGTACGTTC 10 980 

GGACGCGGTG A CTTTTG AAA TTACCCACGA GATCGCTTCC CTGCACACCG CACTTGGCTA 1104 0 

CTCATCAGTC ATCGCCCCGG CCCACGTGGC CGCCATAACT ACAGACATGG GAGTACATTG 1110 0 

TCAGGACCTC TTTATGATTT TCCCAGGGGA CGCGTATCAG GACCGCCAGC TGCATGACTA 11160 

TATCAAAATG AAAGCGGGCG TGCAAACCGG CTCACCGGGA AACAGAATGG ATCACGTGGG 1122 0 

ATACACTGCT GGGGTTCCTC G CTG CGAGAA CCTGCCCGGT TTGAGTCATG GTCAGCTGGC 112 8 0 

AACCTGCGAG ATAATTCCCA CGCCGGTCAC ATCTGACGTT G CCTATTTC C AGACCCCCAG 1134 0 

C^ZZZCCGG GGGCGTGCGG CGTCGGTCGT GTCGTGTGAT GCTTACAGTA ACGAAAGCGC 11400 

AGAGCGTTTG TTCTACGACC ATT CAAT AC C AGACCCCGCG TACGAATGCC GGTCCACCAA 114 6 0 

CAACCCGTGG GCTTCGCAGC GTGGCTCCCT CGGCGACGTG CTATACAATA TCACCTTTCG 11520 

CCAGACTGCG CTGCCGGGCA TGTACAGTCC TTGTCGGCAG TTCTTCCACA AGGAAGACAT 11560 

TATGCGGTAC AATAGGGGGT TGTACACTTT GGTTAATGAG TATTCTGCCA GGCTTGCTGG 11640 

GGCZZZZZZZ ACCAGCACTA CAGACCTCCA GTACGTCGTG GTCAACGGTA CAGACGTGTT 11700 

TTTGG AC C AG CCTTGCCATA TGCTGCAGGA GGCCTATCCC ACGCTCGCCG CCAGCCACAG 11760 

AGTTATGCTT G CCGAGTACA TGTCAAACAA GCAGACACAC GCCCCAGTAC ACATGGGCCA 11820 

GTATCTCATT GAAGAGGTGG CGCCGATGAA GAGACTATTA AAGCTCGGAA ACAAGGTGGT 11880 

GTATTAG CTA ACCCTTCTAG CGTTGGCTAG TCATGGCACT CGACAAGAGT AT AGTGG TT A 1194 0 

ACTTCACCTC CAGACTCTTC GCTGATGAAC TGGCCGCCCT TCAGTCAAAA AT AG GG AG CG 12 COO 

TACTGCCGCT CGGAGATTGC CACCGTTTAC AAAATATACA GGCATTGGGC CTGGGGTGCG 12 05 0 

TATG CTCACG TGAGACATCT CCGGACTACA TCCAAATTAT GCAGTATCTA TCCAAGTGCA 1212 0 

CACTCGCTGT CCTGGAGGAG GTTCGCCCGG ACAGCCTGCG CCTAACGCGG ATGGATCCCT 1218 0 
CTGACAACCT T C AG AT AAAA AACGTATATG CCCCCTTTTT TCAGTGGGAC AGCAACACCC 
AGCTAGCAGT GCTACCCCCA TTTTTTAGCC GAAAGGATTC CACCATTGTG CTCGAATCCA 
ACGGATTTGA CCCCGTGTTC CCCATGGTCG TGCCGCAGCA ACTGGGGCAC GCTATTCTGC 
AGCAGCTGTT GGTGTACCAC ATCTACTCCA AAATATCGGC CGGGGCCCCG GATGATGTAA 
ATATGGCGGA A CTTG AT CTA TAT AC CA C CA ATGTGTCATT TATGGGGCGC ACATATCGTC 
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TGGACGTAGA CAACACGGAT CCACGTACTG CCCTGCGAGT G CTTGACGAT CTGTCCATGT 

ACCTTTGTAT CCTATCAGCC TTGGTTCCCA GGGGGTGTCT CCGTCTGCTC ACGGCGCTCG 126 OC 

TGCGGCACGA CAGGCATCC7 CTGACAGAGG TGTTTGAGGG GGTGGTGCCA GATGAGGTGA 1266 C 

CCAGGATAGA TCTCGACCAG TTGAGCGTCC CAGATGACAT CACCAGGATG CGCGTCATGC 1272: 
TCTCCTATCT TCAGAGTCTC AGTTCTATAT TTAATCTTG G CCCCAGACTG CACGTGTATG 
CCTACTCGGC AGAGACTTTG GCGGCCTCCT GTTGGTATTC CCCACGCTAA CGATTTGAAG 
CGGGGGGGGT ATGGCGTCAT CTGATATTCT GTCGGTTGCA AGGACGGATG ACGGC7CCGT 

CTGTGAAGTC TCCCTGCGTG GAGGTAGGAA AAAAACTACC GTCTACCTGC CGGACACTGA 12 96 0 

ACCCTGGGTG G TAG AG AC CG ACGCCATCAA AGACGCCTTC CTCAGCGACG GGATCGTGGA 13C2 0 

TATGGCTCGA AAGCTTCATC GTGGTGCCCT GCCCTCAAAT TCTCACAACG GCTTGAGGAT 13060 

GGTG CTTTTT TGTTATTGTT ACTTGCAAAA TTGTGTGTAC CTAGCCCTGT TTCTGTGCCC 13140 

CCTTAATCC7 TACTTGGTAA CTCCCTCAAG CATTGAGTTT GCCGAGCCCG TTGTGGCACC 13200 

TGAGGTGCTC TTCCCACACC CGG CTGAGAT GTCTCGCGGT TGCGATGACG CGATTTTCTG 13260 

TAAACTGCCC TATACCGTGC CTATAATCAA CACCACGTTT GGACG CATTT ACCCGAACTC 13320 

T AC A C G C G AG CCGGACGGCA GGCCTACGGA TTACTCCATG GCCCTTAGAA GGGCTTTTGC 13380 

AGTTATGGTT AACACGTCAT GTG CAGGAGT GACATTGTGC CGCGGAGAAA CTCAGACCGC 1344 0 

ATCCCGTAAC CACACTGAGT GGGAAAATCT GCTGGCTATG TTTTCTGTGA TTATCTATGC 13500 

CTTAGATCAC AACTGTCACC CGGAAG CACT GTCTATCGCG AGCGGCATCT TTGACGAGCG 13 56 0 

TGACTATGGA TT ATT CAT CT CTCAGCCCCG GAGCG7GCCC TCGCCTACCC CTTGCGACGT 13 62 0 

GTCGTGGGAA GAT AT CTACA ACGGGACTTA CCTAGCTCGG CCTGGAAACT GTGACCCCTG 13680 

GCCCAATCTA TCCACCCCTC CCTTGATTCT AAATTTTAAA TAAAGGTGTG TCACTGGTTA 13 74 0 

CACCACGATT AAAAACCACT CACTGAGATG TCTTTTTAAC CG CTAAGGG A TTATACCGGG 13800 

ATTTAAAACC GCCCACTGAT TTTTTTACGC TAAGAGTTGG GTG CTTGGGG GGTTTTGCAT 13860 

TGCTCTGTTG T AAA CT AT AT AT AAG TT AAA CCAAAATTCG CAGGGAGACA AGGTGACGGT 13 92 0 

ggtg ^ gaa;:t CAGTT GAGAG TCAGAGAATA CAGTGCTAAT CAGGGTAGAT GAGCATGACT 13 980 

TTCCCCGTCT CCAGTCACCG GAG G AATGGT GGACGGCTCC GTCCTGGTGC GAATGGCCAC 14 04C 

CAAGCCTCCC GTGATTGGTC TTATAACAGT GCTCTTCCTC CTAGT CAT AG GCGCCTGCGT 14100 

CTACTGCTGC ATTCGCGTGT TCCTGGCGGC TCGACTGTGG CGCGCCACCC C A CT AG G C AG 14160 

GGCCACCGTG GCGTATCAGG TCCTTCGCAC CCTGGGACCG CAGGCCGGGT CACATGCACC 14220 

GCCGACGGTG GGCATAGCTA CCCAGGAGCC CTACCGTACA ATATACATGC CAGATTAGAA 142 8 0 

CGGGGTGTGT GCTATAATGG ATGGCTATGG GGGGGGGCTG T AG AT AATT G AG CG CTGTG C 14 34 0 

TTTTATTGTG GGGATATGGG CTTGTACATG TGTCTATCAT CGGTAGCCAT AAAATGGGCC 144 00 

ATGACAACTG CCACAAGTAA GTCGTCCGAC ATGTGCTTTT GCTTGGCGCT GTATGACTGC 14460 

CCTCCATCCC TAAGCGGGAC GCACTTGATC GCGCGGACCT GTTCTACCAG GTAGGTCACC 14 52 0 
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GGGTCAAATG ATATTTTGAT GGTGTTGGAC ACCACCG7CT GGCTGGCGCT CAGGGTGCCG 
GAGTTCAGAG CGTAGATGAA TGTCTCAAAC GCGGAGGATT TCTCGCCTCC CAACATG7AA 
ATTGGCCACT GCAGGGCGCT GCTC7TGTCA GTATAGTGTA G AAAATG TAT GGGGAGCGGG 
CATATTT CGT TAAGGACGGT TGCAATGGCC ACCCCAGAAT CTTGGCTGCT GTTGCCTTCG 
^ZCGCCGCGZ TCACGCGCTC AATTGTGGTG TGGAG CACAG CGATCGCCTT AATCATCGTG 
CATGCGCAGG ACGCTATCTC GTAAGCAGCT GCGCCAGTGA GGTCGCGCAG GAAGAAATG C 
ZZC ^ T3ZZZA ATATGAGGCT tcTGGTGGGA GTCTGAGTAC TCGTGACAAC GGCGCCCACC- 
CCAGTACCGG ACGCCTCCGT GTTGTTCGTA TACGCGGGGT CGATGTAAAC AAA C AG CTGT 
TTTCCAAGGC ACTTCTGAAC CTCCTGGGCG GTGGTGTCTA CCCGACACAT GTCAAACTGT 
GTCAGCGCTG CGTCACCCAC CACGCGGTAA AGCGTAGCAT TTGACGACGC TGCTCCCTCG 15120 
CCCATTAGTT CGGTGTCGAA TGCCCCCTCC ATAAAGAGGT TGGTGGTGGT TTTGATGGAT 151 BO 
TCGTCGATGG TGATGTACGT CGGAATGTGC AGTCTGTAAC AAGGACAGGA CACTAGTGCG 1524 0 
TCTTGCAGGT GGAAATCTTC TCGGTGGTCC GCACACACGT AACTGAC CAC ATT C AG CATC 15300 
TTTTCCTGGG CGTTCCTGAG GTTAAGCAGG AAACTCGTGG AGCGGTCTGA CGAGTTCACG 15360 
GATGATATAA ATATAAG CTT GGCGTCTTTC TGAAGCATGA AACCCAGAAT AGCCGGCAGT 
G CAT C CTTTT TAATAAAATT CGCCTCGTCT ACGTAGAGCA GGTTAAAGGT CTGTCCCCGA 
ATGCTCTGCA GACACGGAAA GACACAAAAG AGGGG CTCAT AAGCGGCTAA CAGTAAAGGA 15540 
GAGGAGGCGA ACAGTGCGTG GCTCTTGGTT CTTGGGAATA AAAGGGGGCG TGTGTGCCGA 
TCGATCGTAT GGGTGAGCCA GTGGATCCTG GACATGTGGT GAATGAGAAA GATTTTGAGG 
AGTGTGAACA ATTTTTCAGT CAACCCCTTA GGGAGCAAGT GGTCGCGGGG GTCAGGGCAC 
TCGACGGCCT CGGTCTCGCT GACTCTCTAT GTCACAAAAC AG AAAG ACT C TGCCTGCTGA 
TGGACCTGGT GGGCACGGAG TGCTTTGCGA GGGTGTG CCG CCTAGACACC GGTGCGAAAT 
GAAGAGTGTG GCGAGTCCCT TATGTCAGTT CCACGGCGTG TTTTGCCTGT ACCAGTGTCG 
CCAGTGCCTG GCATACCACG TGTGTGATGG GGGCGCCGAA TGCGTTCTCC TGCATACGCC 15960 
GGAGAGCGTC ATCTGCGAAC TAACGGGTAA CTGCATGCTC GGCAACATTC AAGAGGGCCA 
GTTTTTAGGG CCGGTACCGT ATCGGACTTT GGATAACCAG GTTGACAGGG A CG CAT AT C A 
CGGGATG CTA GCGTGTCTGA AACGGGACAT TGTGCGGTAT TTG CAGAC AT GGCCGGACAC 
CACCGTAATC GTG CAGGAAA TAGCCCTGGG GGACGGCGTC ACCGACACCA TCTCGGCCAT 
TATAGATGAA ACATTCGGTG AGTGTCTTCC CGTACTGGGG GAGGCCCAAG GCGGGTACGC 
CCTGGTCTGT AGCATGTATC TGCACGTTAT CGTCTCCATC T ATT CG A C AA AAACGGTGTA 
CAACAGTATG CTATTTAAAT GCACAAAGAA TAAAAAGTAC GACTGCATTG CCAAGCGGGT 
GCGGACAAAA TGGATGCGCA TGCTATCAAC GAAAGATACG TAGGTCCTCG CTGCCACCGT 
TTGGCCCACG TGGTGCTGCC TAGGACCTTT CTGCTGCATC ACGCCATACC CCTGGAGCCC 
GAG AT CAT CT TTTCCACCTA CACCCGGTTC AGCCGGTCGC CAGGGTCATC CCGCCGGTTG 
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G7GGTGTG7G GGAAACGTGT CCTGGCAGGG GAGGAAAACC AACTTGCGTC 7TCACCTTCT 
GG777GGCGC TTAGCCTGCC TCTGTTTT CC CACGATGGGA A CTTT CAT C C ATTTGACA7C 
7CGG7AC7GC GCA7T7CCTG CCCTGGTTCT AATCTTAG7C TTACTGTCAG ATTTCTCTAT 
CTATCTCTGG TGGTGGCTAT GGGGGCGGGA CGGAATAA7G CGCGGAGTCC GACCGTTGAC 
GGGG7A7CGC CGCCAGAGGG CGCCGTAGCC CACCCT77GG AGGAAC7GCA GAGGC7GGCG 
CG7GCTACGC CGGACCCGGC AC7CACCCG7 GGACCG7TGC AGG7CC7GAC CGGCCTTCTC 
CGCGCAGGG7 CAGACGGAGA CCGCGCCAC7 CACCACA7GG CGCTCGAGGC TCCGGGAACC 
G7GCG7GGAG AAAGCCTAGA CCCGCC7G7T TCACAGAAGG GGCCAGCGCG CACACGCCAC 1704 C 
AGGCCACC2C CCGTGCGACT GAGCTTCAAC CCCGTCAATG CCGA7GTACC CGCTACCTGG 17100 
CGAGACGCCA C7AACG7GTA C7CGGG7GC7 CCC7ACTA7G 7G7G7G7T7A CGAACGCGGT 17 ISC 
GGCCGTCAGG AAGACGACTG GCTGCCGATA CCACTGAGC7 7CCCAGAAGA GCCCGTGCCC 17220 
CCGCCA2CGG GC77AG7G7T CATGGACGAC T7G77CA7TA ACACGAAGCA GTGCGAC7TT 
GTGGACACGC TAGAGGCCGC CTGTCGCACG CAAGG CTACA CG77GAGACA GZGZZTGZZZ 
GTCGCCATTC C7CGCGACGC GGAAATCGCA GACGCAGT7A AATCGCACTT 777AGAGGCG 
TGCCTAGTGT 7ACGGGGGC7 GGCTTCGGAG GCTAGTGCCT GGATAAGAGC 7GCCACG7CC 1746 0 
CCGCCCCTTG GCCGCCACGC CTGCTGGATG GACG7G77AG GA7TA7GGGA AAGCCGCCCC 17520 
CACACTCTAG G777GGAG7T ACGCGGCG7A AAC7GTGGCG GCACGGACGG TGAC7GGT7A 175B0 
GAGATTTTAA AACAGCCCGA TGTGCAAAAG ACAG7CAGCG GGAGTCTTGT GGCATGCG7G 17 640 
ATCGTCACAC CCGCA77GGA AGCCTGGCTT GTG7TACCTG GGGG7TT7GC 7A77AAAGCC 
CGCTATA3GG CG7CGAAGGA GGA7C7GG7G T7CATTCGAG GCCGCTATGG C7AGCCGGAG 
GCGCAAACTT CGGAATTTCC 7AAACAAGGA ATG CA7ATGG AC7GT7AACC CAA7G7CAGG 17820 
G G AC 2 AT AT Z AAGGTCT7TA ACGCCTGCAC CTC7A7CTCG CCGGTG7ATG ACCCTGAGC7 178 80 
GGTAACCAGC TACGCAC7GA GCGTGCCTGC TTACAATGTG TC7G7GGC7A 7CTTG CTGCA 
TAAAGTCA7G GGACCGTGTG 7GGC7GTGGG AA7TAACGGA GAAATGATCA 7GTACGTCGT 
AAGC2AGTGT GTTTCTGTGC GGCCCGTCCC GGGGCGCGA7 GG7A7GG CGC TCATCTAC7T 
TGGACAGTTT CTGGAGGAAG CATCCGGACT GAGATTTCCC TACAT7G C7C CGCCGCCGTC 
GCGCGAACAC G7ACCTGACC TGACCAGACA AGAATTAGTT CATACCTCCC AGGTGGTGCG 
CCGCGGCGAC CTGACCAA77 GCAC7A7GGG 7CTCGAA7TC AGGAA7G7GA ACCCTTT7GT 
TTGGCTCGGG GGCGGATCGG TG7GGCTGCT G7TC7TGGGC GTGGAC7ACA 7GGCG77C7G 
TCCGGG7GTC GACGGAATGC CG7CG77GGC AAGAG7GGCC GCCCTGCTTA CCAGGTGCGA 
CCACCCAGAC 7G7GTCCAC7 GCCATGGACT CCG7GGACAC GT7 AATG 7 AT TTCGTGGGTA 
CTGTTCTGCG CAG7CGCCGG G7C7ATCTAA CATCTGTCCC TG7A7CAAA7 CA7G7GGGAC 18480 
CGGGAA7GGA G7GACTAGGG TCACTGGAAA CAGAAA7TTT CTGGGTCTTC 7G77CGA7CC 
CATTGTCCAG AG CAGGGTAA CAGCTCTGAA GATAACTAG C CACCCAACCC CCACGCACGT 
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CGAGAATGTG CTAACAGGAG TGCTCGACGA CGGCACCTTG GTGCCGTCCG TCCAAGGCAC 
CC7GGGTCCTCTTACGAATG TCTGAC7AC7 TCAGCCGC7T GCTGATATAT GAGTGTAAAA 
AAC7TAAGGG CCTGGGCTTA CGTTCTTATT GAAGCATGTT GCGGACATCA G7GAGCTGGA 
CCGTCCTCCG GGTCGCGTGT AGA7TATGGT TCCGTTCTCC T7C7TGATG7 77AAA77777 
GGGGGGGAAC CACCGACAAA GCGTCT7TA7 GATTTCCGCG AACACGGAGT TGGCTACGTG 16 900 
C777TGGTGG G7TACGTACC CAATG7TAAT GT7C7C7ACG GATGCCAGTA GZA7G77GA7 
GATCGCCACC ACTATCCATG TC7TTCCG7G TCTCCTTGGT A7TAGGAATA CGCT7GC777 
T7GCT7AAAC G7C7GTAAAA CACTG77TGG AGT7TGAAA7 AAACCGAAGT ACTG C77AAA 
CAA7CCAAA7 AACTGGTGCG T7777TGTGG GGCCTTGATT GAAAGCAAAA AGAAAAAAGT 
GTGCA7TAC7 AGCTGCTGT7 GGAAGGGCTC CAGCCAGTGC ACCCCGGGAA CGTAA3AGCC 
G77GAGAAAG GACGAAAGGT T AAC C AG AAA AGCCTGAAGT TCGCGGTAGA CAGAG CAGGC 
GTGCAGGGAG TCGTGTGTTT T7CTGCCCGC CTGGTACTCG ACCAGTTGAT CGGCCGTGGA 
GACGTGCGCG TCC7CGCGCA CACACCGCAT CTGCAAGTAT G7TGATAGGG AC7CCAA7AG 
GCGCGGC777 GCGGGGACGT TGTCCTCGGA CGGTCTGGGG GTTCCCACGT CGGGAT7TGC 
TGACG7GGGC GTGGCGGGAT GGTGCCGTGT GCAGTATGTT TCCAGGACCG AACTGTATGA 
GTT7A77C7G TGCACCACGC CAATAAAAGG GTGCGCCATC CGTGCCGTTT TGGGACAGTG 
TCGCGTGAA7 GTCGGGGCAC TCAGTTCCCA CCTCTCTCCG GCGTCTTTGG CGG7C7CCTC 
CAGG77GGCG GCAAGGCGCT CCCTGTGACG GCTGAGCAGC ATGTTTGCTT TGAGCTCGC7 
CG7GTCCGAG GGTGACCCGG AGGTGACCAG TAGGTACGTC AAGGGCGTAC AACT7GCCC7 
GGACC77AG- GAGAAdACAC CTGGACAAT7 TAAGTTGATA GAAACTCCCC TGAAGA3C7T 
C77C77GG77 TCGAACGTGA TGCCCGAGGT CCAGCCAATG TGCAGTGGCC GGCCGGCC77 
GGGGGCAGAr 777AGTAA7 Z TCCAC7TGCC TAGACTGGAG AAGCTCCAGA GAG7CG7CGG 
GCAGGGTTTC GGGGCGGCGG GTGAGGAAAT CGCACTGGAC CCG7C7CACG TAGAAACACA 
CGAAAAGGGC CAGGTGTTCT ACAACCAC7A TGC7ACCGAG GAG7GGACG7 GGGC7T7GAC 
TCTGAATAAG GATGCGCTCC T7CGGGAGGC TGTAGATGGC C7G7GTGACC CCGGAACTTG 
GAAGGG7C7T C7TCC7GACG ACCCCCTTCC G77GCTATGG CTGC737TCA AGGGAGCCGC 
G7C77777GT CGGGCCGAC7 G77GCCTGTA CAAGCAG GAC TGCGGTTACC CGGGCCCGG7 
GC7AC77CCA GG7CACATG7 ACGC7CCCAA ACGGGATCT7 7TGTCGTTCG T7AATCATGC 
CCTGAAGTA- ACCAAG777C TATACGGAGA T77T7CCGGG ACA7GGGCGG CGGC7TGCCG 
CCCG3GA77C GCTAC7TC7C GGA7ACAAAG GG7AGTGAG7 CAGATGAAAA 7CA7AGATGC 
T7CCGACAGT 7 A C A77T CCZ AGACCTGGC7 C7TGTGTCAC A7 A7A7CAG Z AAAA7AGCAT 
AA7TGCGGGT CAGGGGACCC ACG7GGG7GG AA7CCTACTG 7TGAGTGGAA AAGGGACCCA 
G 7 AT AT AA CA GGCAATGTTC AG AC C CAAAG G7GTCCAAC7 ACGGGGGACT A7C7AA7CAT 
CCCA7CGTA7 GACATACCGG CGA7CA7CAC CA7GA7CAAG GAGAATGGAC 7GAACCAAC7 
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CTAAAAGAGA GTTTATTAAG TCGGCTCTGG AGGCCAACAT CAACAGGAGG GCAGCTGTAT 2 070C 

2 0 T 1 Z 

CGCTATTTGA 

(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4131 base pairs 
{3) TYPE: nucleic acid 
(C) STRANDEDNESS : Single 
{DJ TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: N 

(iv) Aim -SENSE: N 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1 . .4131 
CD) OTHER INFORMATION: 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

A^G GAG GCG ACC TTG GAG CAA CGA CCT TTC CCG TAC CTC GCC ACG GAG 

Me- Glu Ala Thr Leu Glu Gin Arg Pro Phe Pro Tyr Leu Ala Thr Glu 

1 5 10 15 

G~~ AA~ CTC CTA ACG CAG ATT AAG GAG TCG GCT GCC GAC GGA CTC TTC 

A^a Asn Leu Leu Thr Gin lie Lys Glu Ser Ala Ala Asp Gly Leu Phe 

20 25 30 

AAG AG " TTT CAG CTA TTG CTC GGC AAG GAC GCC AGA GAA GGC AGT GTC 

Lvs Ser Phe Gin Leu Leu Leu Gly Lys Asp Ala Arg Glu Gly Ser Va. 

35 40 45 

„ G „^ G „ g -.3 CTA CTG GGC GTA TAT ACC AAT GTG GTG GAG TTT GTT 192 

Arc Phe" Glu Ala Leu Leu Gly Val Tyr Thr Asn Val Val Glu Phe Va: 

55 60 



3 'w 



CT3 GAG ACC GCC CTC GCC GCC GCT TGC GTC AAT ACC GAG Al C 

-vs'Ph- L~u Glu Thr Ala Leu Ala Ala Ala Cys Val Asn Thr Glu Phe 
~£ 5 70 75 80 

AAG GAC CTG CGG AGA ATG ATA GAT GGA AAA ATA CAG TTT AAA ATT TCA 
*vs Asp Leu Arg Arg Me: lie Asp Gly Lys lie Gin Phe Lys ae Ser 
85 9° 95 

- -~ - — - A . — a — " GCC CAC GGA GAC GGG AGG AGG CCG AAC AAG CAG AGA 
Me- P-o Thr lie Ala His Gly Asp Gly Arg Arg Pro Asn Lys Gin Arg 
100 105 110 

CAG TAT ATC GTC ATG AAG GCT TGC AAT AAG CAC CAC ATC GGT GCG GAG 
G'n Tv- T le Val Met Lys Ala Cvs Asn Lys Kis His G±\ A^a G^u 

-* : 15 120 125 

A*^ GAG C-T GCG GCC GCA GAC ATC GAG CTT CTC TTC GCC GAG AAA GAG 
Tie Glu Leu Ala Ala Ala Asp lie Glu Leu Leu Phe Ala G^u Lys G^u 
130 135 140 

ACG CCC TTG GAC TTC ACA GAG TAC GCG GGT GCC ATC AAG ACG ATT ACG 



48 



96 



144 



240 



286 



336 



3B4 



432 



480 



Thr Pro Leu Asp Pne Thr Glu Tyr Ala Gly Ala lie Lys Thr He Thr 
145 150 155 1" 
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TCG GCT TTG CAG TTT GGT ATG GAC GCC CTA GAA CGG GGG CTA GTG GAC 5 2 E 

S*— Ala Leu Gin Phe Gly Met Asp Ala Leu Glu Arc Gly Leu Val Asp 

155 170 175 

ACG GTT CTC GCA GTT AAA CTT CGG CAC GCT CCA CCC GTC TTT ATT TTA 5 76 

th^ val Leu Ala Val Lvs Leu Arg His Ala Pro Pro Val Phe He Leu 

190 " 185 190 

AAG ACG CTG GGC GAT CCC GTC TAC TCT GAG AGG GGC CTC AAA AAG GCC 624 

Lvs Thr Leu Gly Asp Pro Val Tyr Ser Glu Arg Gly Leu Lys Lys Ala 
295 200 2C5 

GTZ AAG TCT GAC ATG GTA TCC ATG TTC AAG GCA CAC CTC ATA GAA CAT 6 72 

Va* Lvs Ser Asd Men Val Ser Met Phe Lys Ala His Leu He G-u His 
210 ' 215 22C 



TCA TTT TTT CTA GAT AAG GCC GAG CTC ATG ACA AGG GGG AAG CAG TAT 
So- ph» ^h~ Leu Asd Lvs Ala Glu Leu Met Thr Arg Gly Lys Gin Tyr 
225 * 230 235 240 



GTC TTT AAG GGT GTC AGC ACG TAC ACC ACG GCC TCT GGG CAG CAG GTG 

Val Phe Lvs Gly Val Ser Thr Tyr Thr Thr Ala Ser Gly Gin Gin Val 

260 265 270 

GCC GGC GTC CTG GAG ACG ACG GAC AGC GTC ATG AGA CGG CTG ATG AAC 

Ala Gly Val Leu Glu Thr Thr Asp Ser Val Met Arg Arg Leu Met Asn 
275 280 285 

CTG CTG GGG CAA GTG GAA AGT GCC ATG TCC GGG CCC GCG GCC TAC GCC 

Leu Leu Giv Gin Val Glu Ser Ala Met Ser Gly Pro Ala Ala Tyr Ala 
290 " 295 300 



CCC AAC GCT CTG CCG TCT GTG GAA GGT GAC AAG GCC GCT CTG GCG GAC 
Pro A sr. Ala Leu Pro Ser Val Glu Gly Asp Lys Ala Ala Leu Ala Asp 
340 345 350 



CAC GAC GAG ATT CAG AGA ACC CGC ATC GCC GCC TCT CTC GTC AAG 

Glv His Ast Glu He Gin Arg Thr Arg lie Ala Ala Ser Leu Val Lys 
33= 360 365 ■ 

ATA GGG GAT AAG TTT GTG GCC ATT GAA AGT TTG CAG CGC ATG TAC AAC 

~ie Cv Asp Lvs Phe Val Ala He Glu Ser Leu Gin Arg Met Tyr Asn 
370 * 375 360 

GAG AC" CAG TTT CCC TGC CCA CTG AAC CGG CGC ATC CAG TAC ACC TAT 

G'u T"" G~"i Phe Pro Cys Pro Leu Asn Arg Arg He Gin Tyr Thr Tyr 

38 " 5 390 395 400 

"TC TTC CCT GTT GGC CTT CAC CTT CCC GTG CCC CGC TAC TCG ACA TCC 

Phe Phe Pro Val Gly Leu His Leu Pro Val Pro Arg Tyr Ser Thr Ser 
405 410 415 

GTC TCA GTC AGG GGC GTA GAA TCC CCG GCC ATC CAG TCG ACC GAG ACG 

Va" Se- Val Ara Gly Val Glu Ser Pro Ala He Gin Ser Thr Glu Thr 
420 425 430 



72 0 



GTC CTA ACC ATG CTC TCC GAC ATG CTG GCC GCG GTG TGC GAG GAT ACC "6 6 

Va 1 Leu Thr Met Leu Ser Asp Met Leu Ala Ala Val Cys Glu Asp Thr 
245 250 255 



816 



864 



912 



960 



_r TAC GTT GTC AGG GGT GCC AAC CTC GTC ACC GCC GTT AGC TAC GGA 

50- Tvr Val Val Ara Gly Ala Asn Leu Val Thr Ala Val Ser Tyr Gly 
305 ' " 310 315 320 

AGG GCC- ATG AGA AAC TTT GAA CAG TTT ATG GCA CGC ATA GTG GAC CAT 1008 

A-o Ala Me: Arc: Asn Phe Glu Gin Phe Met Ala Arg He Val Asp His 
325 330 335 



1056 



1104 



1152 
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_- r , G ~r AAA AAC AAC GTG CCT CTT TGC TTC GGT TAC CAA AAZ 

TGG o^o G^i A^. a****- ^ Td ,, rvet php nlv tvt Gin Asn 

435 



1% 5S £n ^ Val Pro Leu Cys Phe Gly Tyr «» Asn 



-AA AAA GCG CTC GTA ACC ACG GAG GAT CTA CTG CAC CCA ACC TCT 

lr a 5^ Ala Leu val Thr Thr Glu Asp Leu Leu His Pro Thr Ser 

5 ' 15 520 525 

^ CTC CTC A GA TTG GAG GTC CAC CCC TTC TTT GAT TTT TTT GTG 

u7i lrg Leu Arg Leu Glu Val His Pro Phe Phe Asp Pne Pne Val 

cm 535 540 



„ ^ nCT QGA GCG AGA GGA TCG TAC CGC GCC ACC CAC AGA ACA 

Cys Pro GW Ala Arg Gly Ser Tyr Arg Ala Thr His Arg Tnr 
'il s ' 550 555 

T _ _ - GA AA" ATA CCA CAA CCG CTC GCT CCA AGG GAG TTT CAG GAA 
Val GW tsn lie Pro Gin Pro Leu Ala Pro Arg Glu Phe Gin Giu 
565 570 => /3 

G -~ rAG TTC GAC GCT GTG ACG AAT ATG ACA CAC GTC ATA 

A~ Glv 111 Gin Phe Asp Ala Val Thr Asn Met Thr His Val He 
- 590 



5B0 



_ , gac GTC ATA CAG GAG ACG GCA TTT GAC CCC GCG 

As^ G~n LeC Thr lie Asp Val He Gin Glu Thr Ala Phe Asp Pro Aia 
" ^ " 503 600 605 

_ c _ _ 3 TGZ TAT GTA ATC GAA GCA ATG ATT CAC GGA CAG GAA 

f^u Ph- Cys Tyr Val lie Glu Ala Mer He His Gly Gin .lu 
610 " 515 620 

_ . r — tTG AA" ATG CCC CTC ATT GCC CTG GTC ATT CAA ACC 

s~ s£ v-;i p- l.» n. ^ ^ ^ 

£23 €30 6 3 D 

£ S 8 iK I:; Si; £ ffi S 52 5? iS Si 5S »? 

s s s s i s s s ss $s s K s i ss 



660 665 



675 



— ~~ AAG CTC GCG GGA CAC GAG ACG GTG GGT CGG ACG CCG 
Gin Ala Leu Leu Leu Ala Gly His Glu Thr Val Gly Arg .nr Pro 

- - - 6 9 3 



144C 



G-C CTC AAA AGC ATA TGC CAC CCT CGA ATG CAC AAC CCC ACC CAG TCA 
Ala Leu Lys Ser lie Cys His Pro Arg Met His Asn Pro .nr Gin Se. 

4 5 0 455 
_ „ r r « AAC CAA GCT TTT CCC GAT CCC GAC GGG GGA CAT GGG 

t'l 1?* Leu" £n Gin Ala Phe Pro Asp Pro Asp Glv Gly P,s Gxy 

465 470 4 = 

- - G3- CTC AGG TAT GAG CAG ACG CCA AAC ATG AAC CTA TTC AGA ACG 
Tvr Gly Leu Arg Tyr Glu Gin Thr Pro Asn Me-. Asn Leu Phe Arg -r„ 

4B5 490 

_ „ r~AG TAC ATG GGG AAA AAC GTG GCA TTT GTT CCC GAT GTG 1536 

i£ His S?n Tyr Tyr Met Gly Lys Asn Val Aia Phe Val Pro Asp Vai 
500 503 



1564 



1632 



1680 



1728 



1776 



1B24 



1872 



1920 



1966 



2016 



, r -~ r-Lp, r—- ATC GCC CTT GAG 2 064 

G ^ S StO H?s ?yr £S ^ ?ii - «? Leu XI. Ala L eu Glu 

l a r n oo3 



2112 



690 
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ATC ACA CAT C7G GTT TCG GCT CTC CTC GAC CCG CAT CTG CTG CCT CCC 216: 

He Thr Kis Leu Val Ser Ala Leu Leu Asp Pro Kis Leu Leu Pro Pro 

70S 710 715 720- 

TTT GCC TAC CAC GAT GTC TTT ACG GAT CTT ATG CAG AAG TCA TCC AGA 220 £ 

Phe Ala Tvr His Asp Val Phe Thr Asp Leu Mer Gin Lys Ser Ser Arg 
725 730 735 

CAA CCC ATA ATC AAG ATC GGG GAT CAA AAC TAC GAC AAC CCT CAA AAT 22 56 

Gin Pre lie He Lys He Giv Asp Gin Asr. Tvr Asp Asn Pro Gin Asn 
740 745 750 

AGG GCG ACA TTC ATC AAC CTC AGG GGT CGC ATG GAG GAC CTA GTZ AAT 2 3 04 

Arc Ala Thr Phe He Asn Leu Arg Gly Arg Met Glu Aso Leu Val Asn 
755 760 765 

AAC CTT GTT AAC ATT TAC CAG ACA AGG GTC AAT GAG GAC CAT GAC GAG 2 3 52 

Asn Leu Val Asn He Tyr Gin Thr Arg Val Asn Glu Asp His Asp Glu 
770 775 780 

AGA CAC GTC CTG GAC GTG GCG CCC CTG GAC GAG AAT GAC TAC AAC CCG 24 00 

Arg Kis Val Leu Asp Val Ala Pro Leu Asp Glu Asn Asp Tyr Asn Pro 
785 790 795 e00 

GTC CTC GAG AAG CTA TTC TAC TAT GTT TTA ATG CCG GTG TGC AGT AAC 244 8 

Val Leu Glu Lys Leu Phe Tyr Tyr Val Leu Met Pro Val Cys Ser Asn 
805 810 815 

GGC CAC ATG TGC GGT ATG GGG GTC GAC TAT CAA AAC GTG GCC CTG ACG . 24 96 

Gly Kis Met Cys Gly Met Gly Val Asp Tyr Gin Asn Val Ala Leu Thr 
820 825 830 

CTG ACT TAC AAC GGC CCC GTC TTT GCG GAC GTC GTG AAC GCA CAG GAT 2544 
Leu Thr Tyr Asn Gly Pro Val Phe Ala Asp Val Val Asn Ala Gin Asp 
E35 840 845 

GAT ATT CTA CTG CAC CTG GAG AAC GGA ACC TTG AAG GAC ATT CTG CAG 2 5 92 

Asp He Leu Leu His Leu Glu Asn Gly Thr Leu Lys Asp He Leu Gin 
" E50 855 86 0 

GCA GGC GAC ATA CGC CCG ACG GTG GAC ATG ATC AGG GTG CTG TGC ACC 264 0 

Ala Gly Asp He Arg Pro Thr Val Asp Met He Arg Val Leu Cys Thr 
865 870 875 880 

TCG TTT CTG ACG TGC CCT TTC GTC ACC CAG GCC GCT CGC GTG ATC ACA 26 8 8 

Ser Phe Leu Thr Cys Pro Phe Val Thr Gin Ala Ala Arg Val lie Thr 
885 890 895 

AAG CGG GAC CCG GCC CAG AGT TTT GCC ACG CAC GAA TAC GGG AAG GAT 2 736 

Lys Arg Asp Pro Ala Gin Ser Phe Ala Thr Kis Glu Tyr Gly Lys Asp 
900 905 910 

GTG GCG CAG ACC GTG CTT GTT AAT GGC TTT GGT GCG TTC GCG GTG GCG 2 7 84 

Val Ala Gin Thr Val Leu Val Asn Gly Phe Gly Ala Phe Ala Val Ala 
915 920 925 

GAC CGC TCT CGC GAG GCG GCG GAG ACT ATG TTT TAT CCG GTA CCC TTT 2B32 
Asp Arg Ser Arg Glu Ala Ala Glu Thr Met Phe Tyr Pro Val Pro Phe 
930 935 940 

AAC AAG CTC TAC GCT GAC CCG TTG GTG GCT GCC ACA CTG CAT CCG CTC 2 BSC 

Asn Lys Leu Tyr Ala Asp Pro Leu Val Ala Ala Thr Leu His Pro Leu 
945 950 955 960 

CTG CCA AAC TAT GTC ACC AGG CTC CCC AAC CAG AGA AAC GCG GTG GTC 2 92 8 

Leu Pro Asn Tyr Val Thr Arg Leu Pro Asn Gin Arg Asn Ala Val Val 
965 970 975 
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TTT AAC GTG C"A TCC AAT CTC ATG GCA GAA TAT GAG GAA TGG CAC AAG 

Phe Asn Val Pro Ser Asn Leu Met Ala Giu Tat Glu Glu Trp His Lys 
9B0 9 85 990 

TCG CCC G~C GCG GCG TAT GCC GCG TCT TGT CAG GCC ACC CCG GGC GCC 
Se~ P^o Va^ Ala Ala Tyr Ala Ala Ser Cys Gin Ala Thr Pro Gly Ala 
99 5 1000 1005 

ATT AGC GCC ATG GTG AGC ATG CAC CAA AAA CTA TCT GCC CCC AGT TTC 
He Ala Met: Val Ser Met His Gin Lys Leu Ser Aia Pr~ Se. Pne 

1010 1015 1020 

A-T TGC CAG GCA AAA CAC CGC ATG CAC CCT GGT TTT GCC ATG ACA GTC 
Cvs G" n Ala Lvs His Arg Met His Pro Gly Phe Ala Met Tnr Val 
Ills ' 1030 1035 10.0 

GT~ AGG ACG GAC GAG GTT CTA GCA GAG CAC ATC CTA TAC TGC TCC AGG 
at- aso Glu val Leu Ala Glu His He Leu Tyr Cys Ser Arg 

' * 10 45 1050 1055 

GrQ -r<-Q A CA TCC ATG TTT GTG GGC TTG CCT TCG GTG GTA CGG CGC GAG 
t T. cZ- T-X- cpv M^r Phe Val Glv Leu Pro Ser Val Val Arg Arg G*u 
" 1060 ~ * 1065 1070 

GTA CG- ~CG GAC GCG GTG ACT TTT GAA ATT ACC CAC GAG ATC GCT TCC 
Val A*" a Ser Asp Ala Val Thr Phe Glu lie Thr His Glu He Aia Ser 
" 10 75 1080 1085 

-TG -A'" A~~ GCA CTT GGC TAC TCA TCA GTC ATC GCC CCG GCC CAC GTG 
Z<1 Ala Leu Gly Tyr Ser Ser Val He Ala Pro Ala His Val 

1090 1095 HOO 

G"~ G"~ ATA ACT ACA GAC ATG GGA GTA CAT TGT CAG GAC CTC TTT ATG 
Ala Ala He Thr Thr Asp Met Gly Val His Cys Gin Asp Leu Pne Met 
1105 1H° 1115 1120 

A TC —A GGG GAC GCG TAT CAG GAC CGC CAG CTG CAT GAC TAT ATC 

pKo p-c Glv Asp Ala Tvr Gin Asp Arg Gin Leu His Asp Tyr He 
* * 112 5 " H30 H35 

AAA ATG AAA GCG GGC GTG CAA ACC GGC TCA CCG GGA AAC AGA ATG GAT 
^ Lvs Ala Glv Val Gin Thr Gly Ser Pro Gly Asn Arg Met Asp 

1140 ' H45 1150 

— ~ GGA "A" ACT GCT GGG GTT CCT CGC TGC GAG AAC CTG CCC GGT 

-3-s Va r G*-v Tv~ Thr Ala Gly Val Pro Arg Cys Glu Asn Leu Pro Giy 
^55 * 1160 H65 

— G A~~ CA~ GGT CAG CTG GCA ACC TGC GAG ATA ATT CCC ACG CCG GTC 
Liu Se- H?s Glv Gin Leu Ala Thr Cys Glu He He Pro Thr Pro Val 
1170 ' 1180 

ACA ™CT GAC GTT GCC TAT TTC CAG ACC CCC AGC AAC CCC CGG GGG CGT 
?hr Ser Asd Val Ala Tyr Phe Gin Thr Pro Ser Asn Pro Arg Glv Arg 
1185 " 119 0 115= ^ 200 

. GT~ GTG TCG TGT GAT GCT TAC AGT AAC GAA AGC GCA GAG 

Xla Ala Ser Val Val Ser Cys Asp Ala Tyr Ser Asn Giu Ser Aia Glu 
1205 1210 — 5 

_ T ^ G ~ AC GAC CAT TCA ATA CCA GAC CCC GCG TAC GAA TGC CGG 

Xr^ Leu Phe T^r Asp His Ser He Pro Asp Pro Ala Tyr Glu Cys Arg 
" 1220 1225 i^ju 

t™ A-C AA- AAC CCG TGG GCT TCG CAG CGT GGC TCC CTC GGC GAC GTG 
Tnr £n A^n Pro Trp Ala Ser Gin Arg Gly Ser Leu Gly Asp Val 
12 35 124D I 245 



C97c 



3C24 



3072 



3I6S 



3216 



3264 



3312 



3360 



3408 



3456 



3504 



355: 



360C 



364B 



3696 



3744 
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r~ TA~ AA^ A' rr * A~C TT CGC CAG ACT GCG CTG CCG GGC ATG 7AC AGT 
™~ £n li; Tnr Phe Arg Gin Thr Ala Leu Pro Gly Met Tyr Ser 
1250 1255 ^ 

CCT TG*T CGG CAG TTC TTC CAC AAG GAA GAC ATT ATG CGG TAC AAT AGG 
Cvs Aro Gin Phe Phe Kis Lys Giu Asp -le Me: Arg ^ 
i2£5 " ' 1270 1275 

rSG — G TA<- ACT TTG GTT AAT GAG TAT TCT GCC AGG CTT GCT GGG GCC 
G'v Leu Tyr Thr Leu Val Asn Glu Tyr Ser Ala Arg Leu A.a o ; y A,a 
1285 1290 —5 

C-C GCC ACC AGC ACT ACA GAC CTC CAG TAC GTC GTG GTC AAC GGT AC A 
P-o Ala Thr Ser Thr Thr Asp Leu Gin Tyr Val Val Vai ns: t G_> .... 
1300 1305 1^-C 

G-3 "TT TTG GAC CAG CCT TGC CAT ATG CTG CAG GAG GCC TAT CCC 
As- Va^ ^ Leu Aso Gin Pro Cys His Met Leu Gin Glu Aia Tyr Pro 
:* 3 "l5 " 1320 1325 

A-G C— G"C GCC AGC CAC AGA GTT ATG CTT GCC GAG TAC ATG TCA AAC 
Leu Ala Ala Ser His Arg Val Met Leu Ala Glu Tyr Met Ser Asn 
i3 3C 1335 1340 

AAG CAG ACA CAC GCC CCA GTA CAC ATG GGC CAG TAT CTC ATT GAA GAG 
Lvs G^r Thr His Ala Pro Val His Met Gly Gin Tyr Leu Giu 
1345 ' 1350 1355 1350 

G~G GCG CCG ATG AAG AGA CTA TTA AAG CTC GGA AAC AAG GTG GTG TAT 
Va" Ala Pro Met Lvs Arg Leu Leu Lys Leu Gly Asn Lys Va: Vax Tyr 
1365 1370 -^375 



:2; INFORMATION FOP. SEQ ID NO : 3 : 

{ i ) SEQUENCE CHARACTERISTICS : 

{A) LENGTH: 1376 amino acids 
' s ; TYPE: amino acid 
(D) TOPOLOGY: linear 

;:;! MOLECULE TYPE : protein 

ixi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

Glu Ala Thr Leu Glu Gin Arg Pro Phe Pro Tyr Leu Ala Thr Glu 
5 10 1 = 

Ala Asr. Leu Leu Thr Gin He Lys Glu Ser Ala Ala Asp Gly Leu Phe 
2 C *" ^ 

Lvs Se- Ph- Gin Leu Leu Leu Gly Lys Asp Ala Arc Glu Gly Ser Val 
35 40 45 

^ — P v, 0 Giu Ala Leu Leu Gly Val Tyr Thr Asn Val Val Glu Phe Val 
-= Vo 55 60 

Lvs Phe Leu Glu Thr Ala Leu Ala Ala Ala Cys Val Asn Thr Glu Phe 

€5 70 5 

Lys Asp Leu Arg Arg Met He Asp Gly Lys lie Gin Phe Lys lie Ser 
es 9° - s 

Met Pro Thr He Ala Kis Gly Asp Gly Arg Arg Pro Asn Lys Gin Arg 
ioo 105 — u 



388E 



3 964 



403C 



4080 



4128 



4131 
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G T n ri e val Met Lvs Ala Cvs Asn Lys His His lie Gly Ala Glu 

115 120 1=5 

Ti e Glu Leu Ala Ala Ala Asp He Glu Leu Leu Phe Ala Glu Lys Glu 

" 130 135 140 

— Pro Leu Asp Phe Thr Glu Tyr Ala Gly Ala He Lys Thr He Thr 

H S 150 155 loO 

Se- Ala Leu Gin Phe Gly Met Asp Ala Leu Glu Arg Gly Leu Val Asp 
165 1 ?0 " • 5 

tv.. va" L-u Ala Val Lvs Leu Arg His Ala Pro Pro Val Phe He Leu 
18C I 85 190 

*"S * T, " L»u Glv Asp Pro Val Tyr Ser Glu Arg Gly Leu Lys Lys Ala 

^ Lvs Aso Me: Val Ser Met Phe Lys Ala His Leu He Glu His 

210 " 215 220 

Se- Ph« Phe Leu Asp Lvs Ala Glu Leu Met Thr Arg Gly Lys Gin Tyr 
225 230 235 240 

va^ Leu -hr Met Leu Ser Asp Met Leu Ala Ala Val Cys Glu Asp Thr 
245 250 2^5 

Va * Ph . lvs Glv Val Ser Thr Tyr Thr Thr Ala Ser Gly Gin Gin Val 
' 26*0 265 270 

^la Glv Val Leu Glu Thr Thr Asp Ser Val Met Arg Arg Leu Met Asn 
* 27= 280 285 

Leu L«u Glv Gin Val Glu Ser Ala Met Ser Gly Pro Ala Ala Tyr Ala 
290 ** 295 300 

Se „ y — va' Va- Arc Glv Ala Asn Leu Val Thr Ala Val Ser Tyr Gly 

305 . " 310 315 320 

A-a Ala v>«- Arc Asn Phe Glu Gin Phe Met Ala Arg He Val Asp Kis 
~ 325 330 335 

p— ^-- a Leu ^ro Ser Val Glu Gly Asp Lys Ala Ala Leu Ala Asp 

34C . 345 350 

«is As- Glu He Gin Arg Thr Arg He Ala Ala Ser Leu Val Lys 
3=5 360 3£5 

-,o g' v Aso Lvs Phe Val Ala He Glu Ser Leu Gin Arg Met Tyr Asn 
370 * ' 375 3 6 0 

G'u -r- Gin Phe Pre Cvs Pro Leu Asn Arg Arg He Gin ■ Tyr Thr Tyr 
3i 5 " " 390 395 400 

Phe Ph* Pro Val Glv Leu His Leu Pro Val Pro Arg Tyr Ser Thr Ser 
405 410 4-= 

Va 1 S— va' Arc Glv Val Glu Ser Pro Ala He Gin Ser Thr Glu Thr 
420 * 425 430 

~rp val Val Asn Lys Asn Asn Val Pro Leu Cys Phe Gly Tyr Gin Asn 
435 44C 44:> 

Ala Leu Lvs Ser He Cys His Pro Arg Met His Asn Pro Thr Gin Ser 
450 * 455 460 

Ala Gin Ala Leu Asn Gin Ala Phe Pro Asp Pro Asp Gly Gly His Gly 
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465 



470 475 480 



Tyr Gly Leu Arg Tyr Glu Gin Thr Pro Asn Men Asn Leu Phe Arg Thr 



485 



Phe His Gin Tyr Tyr Met Gly Lys Asn Val Ala Phe Val Pre Asp Val 



500 



505 



Ala G^r. Lvs Ala Leu Val Thr Thr Glu Asp Leu Leu His Pro Thr Ser 

* 3 - 15 520 =>2 = 

A-c Leu Leu Arc Leu Glu Val His Pro Phe Phe Asp Phe Phe Val 
~ 530 ' - 535 540 

»<s t>- 0 cvs Pro Gly Ala Arg Gly Ser Tyr Arg Ala Thr His Arg Thr 
V: 5 " ' 550 555 

Me- Val Glv Asn He Pro Gin Pro Leu Ala Pro Arg Glu Phe Gin Glu 
" e " * 5£5 570 575 

Ser Arg Gly Ala Gin Phe Asp Ala Val Thr Asn Met Thr His Val He 
580 585 

Asp Gin Leu Thr He As? Val He Gin Glu Thr Ala Phe Asp Pro Ala 

595 600 605 

Phe Cys Tyr Val He Glu Ala Met He His Gly Gin Glu 



Tvr Pro Leu - - __ A 

' 610 615 620 

mu Lvs *>h* Val Met Asn Met Pro Leu He Ala Leu Val He Gin Thr 
S25 ' " 630 635 6.0 

Va -. As - Gly Lys Leu Ala Phe Val Asn Ser Tyr His Met 

*-* " " 645 650 655 

va^ A-a He Cvs Thr His He Gly Asn Gly Ser lie Pro Lys Glu 

" 660 ' 665 670 

A 1 a His Glv His Tvr Arg Lys He Leu Gly Glu Leu He Ala Leu Glu 
*~ £7 5 * 680 6 85 

r — J--a L-- Leu Lvs Leu Ala Gly His Glu Thr Val Gly Arg Thr Pro 

' 695 700 



f. a r-i 
C 



v^ s Leu Val Ser Ala Leu Leu Asp Pro His Leu Leu Pre Pro 

710 715 720 

ph. A^a Tvr His Asp Val Phe Thr Asp Leu Met Gin Lys Ser Ser Arg 
725 730 7J - 

r -.- Lys lie Glv Asp Gin Asn Tyr Asp Asn Pre Gin Asn 

~ W "~ T^o" * 745 - 750 

--c A^a — Phe He Asn Leu Arg Gly Arg Met Glu Asp Leu Val Asn 

755 76 ° 765 

Asn L-u Val Asn He Tyr Gin Thr Arg Val Asn Glu Asp His Asp Glu 

770 775 780 

A-c His Val Leu Asp Val Ala Pro Leu Asp Glu Asn Asp Tyr Asn Pro 

7ei 790 795 

val Leu Glu Lys Leu Phe Tyr Tyr Val Leu Met Pro Val Cys Ser Asn 



805 



Glv His Met Cvs Gly Met Gly Val Asp Tyr Gin Asn Val Ala Leu Thr 
820 823 
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Leu Thr Tvr Asn Glv Pro Val Phe Ala Asp Val Val Asr. Ala Gin Asp 
B35 " 840 645 

Asd He Leu Leu His -Leu Glu Asn Gly Thr Leu Lys Asp lie Leu Gin 
* e50 B55 860 

A^a Glv Asp He Arg Pro Thr Val Asp Met He Arg Val Leu Cys Thr 
865 * 87C B-75 BBC 

Se- Ph» Leu Thr Cys Pre Phe Val Thr Gin Ala Ala Arg Val lie Thr 
685 89° 895 

* vs Asd ^ro Ala Gin Ser Phe Ala Thr Kis Glu Tyr Gly Lys Asp 

' 900 905 510 

Va 1 A" 1 a Gin Thr Val Leu Val Asn Gly Phe Gly Ala Phe Ala Val Ala 
c 15 920 525 

Asp Arc Ser Ara Glu Ala Ala Glu Thr Met Phe Tyr Pro Val Pro Phe 
S3 c " 935 940 

Asr Lvs Leu Tvr Ala Asd Pro Leu Val Ala Ala Thr Leu Kis Pro Leu 
545 * " 950 955 960 

Leu o-c Asn Tvr Val Thr Arg Leu Pro Asn Gin Arg Asn Ala Val Val 
965 970 9*75 

Dh~ Asr Val Pro Ser Asn Leu Met Ala Glu Tyr Glu Glu Trp His Lys 
980 985 990 

Ser Pre Val Ala Ala Tyr Ala Ala Ser Cys Gin Ala Thr Pro Gly Ala 
995 1000 1005 

lie Ser Ala Met Val Ser Met His Gin Lys Leu Ser Ala Pro Ser Phe 
101C 1015 1020 

"vs G"r. Ala Lvs His Ara Met His Pro Gly Phe Ala Met Thr Val 

1C25 ' ' 1030 ~ 1035 1040 

Va- A-g Thr Asd Glu Val Leu Ala Glu His lie Leu Tyr Cys Ser Arg 
1045 1050 1055 

A' a Thr Ser Met Phe Val Gly Leu Pro Ser Val Val Arg Arg Glu 

1060 1065 1070 

Va' Ar=r Ser Asp Ala Val Thr Phe Glu He Thr His Glu He Ala Ser 
1C; 75 " 1080 1085 

L«u His Thr Ala Leu Glv Tvr Ser Ser Val He Ala Pro Ala His Val 
1090 * 1095 HOO 

Ala A" a He Thr Asd Met Gly Val His Cvs Gin Asp Leu Phe Met 

1105 ~ " " 1110 H15 H20 

Dhe Pro Gly Asp Ala Tvr Gin Asp Arg Gin Leu His Asp Tyr He 
11*25 * H3C H35 

Lvs Me- Lvs Ala Glv Val Gin Thr Gly Ser Pro Gly Asn Arg Met Asp 
' 1140 * H45 H50 

"^s Val Glv Tyr Thr Ala Gly val Pro Arg Cys Glu Asn Leu Pro Gly 
1155 1160 1165 

Leu Ser His Gly Gin Leu Ala Thr Cys Glu He He Pro Thr Pro Val 
1170 1175 HBO 

Thr Ser Asp Val Ala Tyr Phe Gin Thr Pro Ser Asn Pro Arg Gly Arg 
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1185 1190 1195 1200 

Ala Ala Ser Val Val Ser Cys Asp Ala Tvr Ser Asn Glu Ser Ala Glu 
1205 1210 1215 

Arg Leu Phe Tyr Asp Kis Ser lie Pro Asp Pro Ala Tyr Glu Cys Arg 
122C * 1225 1230 

Ser Thr Asn Asn Pre Trp Ala Ser Gin Arg Gly Ser Leu Gly Asp Val 
1235 1240 1245 

Leu Tvr Asn He Thr Phe Arc Gin Thr Ala Leu Pro Gly Met Tyr Ser 
1250 1255 1260 

Pro Cys Arc Gin Phe Phe His Lys Glu Asp He Met Arg Tyr Asn Arg 
1265 * ' 1270 1275 ~ " 1280 

Giv Leu Tvr Thr Leu Val Asn Glu Tyr Ser Ala Arg Leu Ala Gly Ala 
1285 1290 1295 

Pro Ala Thr Ser Thr Thr Asp Leu Gin Tyr Val Val Val Asn Glv Thr 
1300 1305 1310 

Asp Val Phe Leu Asp Gin Pro Cys His Met Leu Gin Glu Ala Tvr Pro 
1315 1320 1325 

Thr Leu Ala Ala Ser His Arg Val Met Leu Ala Glu Tvr Met Ser Asn 
1330 1335 1340 

Lys Gin Thr His Ala Pro Val His Met Gly Gin Tyr Leu He Glu Giu 
1345 1350 1355 1360 

Val Ala Pro Met Lys Arg Leu Leu Lys Leu Gly Asn Lvs Val Val Tvr 
1365 1370 * 1375 



INFORMATION FOR SEQ ID NO : 4 : 

£ i ) S £ QUEN CE CHARACTER I S T I CS : 

(A) LENGTH: 1143 base pairs 
(E) TYPE : "nucleic acid 

(C) STRANDEDNESS ; single 
CD) TOPOLOGY: linear 

iii) MOLECULE TYPE : DNA (genomic) 

(ill) HYPOTHETICAL: N 

!:v) ANTI -SENSE: N 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1..1143 

(D) CTKER INFORMATION: 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 

AGC ATT CGG GGA CAG ACC TTT AAC CTG CTC TAC GTA GAC GAG GC3 AAT 4 8 

Ser lie Arg Giv Gin Thr Phe Asn Leu Leu Tyr Val Asp Glu Ala Asn 

1 "5 10 15 

TTT ATT AAA AAG GAT GCA CTG CCG GCT ATT CTG GGT TTC ATG CTT CAG 96 

Phe He Lvs Lys Asp Ala Leu Pro Ala He Leu Gly Phe Met Leu Gin 

20 25 30 
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AAA GAC GCC AAG CTT ATA TTT ATA TCA TCC GTG AAC TCG TCA GAC CGC 
Ala Lys Leu lie Phe lie Ser Ser Val Asn Ser Ser Asp Arg 
2 5 ^ 0 * 



, „~ AG - * — r*TQ CTT AAC CTC AGG AAC GCC CAG GAA AAC 
ci; Se - Phi Leu Leu Asn Leu Arc Asn Ala Gin Glu Lys Me: Leu 

" "5 0 55 60 



AT GTG GTC AGT TAC GTG TGT GCG GAC CAC CGA GAA GAT TTC CAC CTG 

,sn 
£5 



Itn Va" Val Ser Tvr Val Cys Ala Asp His Arg Glu Asp Phe Kis Leu 

70 75 60 



"AA G"* G~A C~A GTG TCC TGT CCT TGT TAC AGA CTG CAC ATT CCG ACG 

G'n Asp Ala Leu Val Ser Cys Pro Cys Tyr Arg Leu Kis He Pro Tnr 

65 90 55 

-A- A— ACC ATC GAC GAA TCC ATC AAA ACC ACC ACC AAC CTC TTT ATG 

~.Z -*« '- e Asp Glu Ser He Lys Thr Thr Thr Asn Leu Pne Met 

"" 100 * 105 HO 

GAG GGG GCA TTC GAC ACC GAA CTA ATG GGC GAG GGA GCA GCG TCG TCA 

G^u Cv Ala Phe Asd Thr Glu Leu Met Gly Glu Gly Ala A^a Ser Ser 

X1 c ' 120 125 

q<~t A ~ G ■ TAC CGC GTG GTG GGT GAC GCA GCG CTG ACA CAG TTT 

Asn Ala -'-r Leu Tvr Arg Val Val Gly Asp Ala Ala Leu Thr Gin Phe 
130 ' 135 140 

GAC A^G TGT CGG GTA GAC ACC ACC GCC CAG GAG GTT CAG AAG TGC CTT 

As- Me- Cvs Ara Val Asp Thr Thr Ala Gin Glu Val Gin Lys Cys Leu 
145 ' 150 155 160 

GGA AAA CAG CTG TTT GTT TAC ATC GAC CCC GCG TAT ACG AAC AAC ACG 

G*v Lvs G^n Leu Phe Val Tyr He Asp Pro Ala Tyr Thr Asn Asn ,hr 
" 165 170 175 

GAG G-G T CC GGT ACT GGC GTG GGC GCC GTT GTC ACG AGT ACT CAG ACT 

G -... A r 5 se- Glv Thr Glv Val Gly Ala Val Val Thr Ser Thr Gin Tnr 
180 * 1B5 190 



173 



144 



191 



24 0 



2 S £ 



384 



432 



480 



528 



576 



A" AGA AGC CTC ATA TTG GGC ATG GAG CAT TTC TTC CTG CGC GAC 6 24 

pZZ t--- ser Leu He Leu Gly Met Glu His Phe Phe Leu Arg Asp 

200 205 



_ GC ^ GCT G CT TAC GAG ATA GCG TCC TGC GCA TGC ACC- ATG 6/2 

L-u Thr Glv Ala Ala Ala Tyr Glu lie Ala Ser Cys Ala Cys Thr Met 

~ Z1C ' 215 220 

A— AAG G"C- ATC GCT GTG CTC CAC ACC ACA ATT GAG CGC GTG AAC GCG 72 0 

- : e Lvs Ala He Ala Val Leu His Thr Thr He Glu Arg Val Asn Ala 

230 235 240 



766 



GC3 GTC GAA GGC AAC AGC AGC CAA GAT TCT GGo G,» 

A" a Val Glu Gly Asn Ser Ser Gin Asp Ser Gly Val Ala He Ala ,hr 
245 250 255 

GTC CTT AAC GAA ATA TGC CCG CTC CCC ATA CAT TTT CTA CAC TAT ACT 616 
Val Leu Asn Glu He Cys Pro Leu Pro He Kis Pne Leu His Ty . .fc- 
260 26S 270 

— AAG AG" AG" GCC CTG CAG TGG CCA ATT TAC ATG TTG GGA GGC GAG 
I2i £vs sir Ser Ala Leu Gin Trp Pro He Tyr Met Leu Gly Giy Gxu 
^ ' 275 280 

AAA TCC TCC GCG TTT GAG ACA TTC ATC TAC GCT CTG AAC TCC GGC ACC 
tvs Ser Ser Ala Phe Glu Thr Phe He Tyr Ala Leu Asn Ser Gxv Thr 
- 290 295 300 



864 



912 
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CTG AGC GCC AGC CAG ACG GTG GTG TCC AAC ACC ATC AAA ATA TCA TTT 96 0 

Leu Ser Ala Ser Gin Thr Val Val Ser Asn Thr lie Lys lie Ser Phe 
305 310 315 32C 

GAC CCG GTG ACC TAC CTG GTA GAA CAG GTC CGC GCG ATC AAG TGC GTC 10 0E 

Asp Pre Val Thr Tyr Leu Val Glu Gin Val Arg Ala lie Lys Cys Val 
325 330 335 

CCG CTT AGG GAT GGA GGG CAG TCA TAC AGC GCC AAG CAA AAG CAC ATG 105 6 

Pro Leu Arg Asp Gly Gly Gin Ser Tyr Ser Ala Lys Gin Lys His Me: 
340 345 350 

TCG GAC GAC TTA CTT GTG GCA GTT GTC ATG GCC CAT TTT ATG GCT ACC 1104 
Ser Asp Asd Leu Leu Val Ala Val Val Met Ala His Phe Met Ala Thr 
255 360 365 

GAT GAT AGA CAC ATG TAC AAG CCC ATA TCC CCA CAA TAA II 4 3 

Asd Asp Arc: His Met Tyr Lvs Pro lie Ser Pro Gin 
370 " 375 380 

(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 380 ammo acids 

(B) TYPE: amino acid 
{ D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 

Ser lie Arc Glv Gin Thr Phe Asn Leu Leu Tyr Val Asp Glu Ala Asn 
1 ' * 5 10 15 

Phe lie Lvs Lvs Asp Ala Leu Pro Ala He Leu Gly Phe Met Leu Gin 
20 25 3C 

Lvs Asp Ala Lvs Leu He Phe He Ser Ser Val Asn Ser Ser Asp Arg 
35 * 40 45 

Ser Thr Ser Phe Leu Leu Asn Leu Arg Asn Ala Gin Glu Lys Met Leu 
50 55 60 

Asn Val Val Ser Tvr Val Cvs Ala Asp His Arg Glu Asp Phe His Leu 
65 70 75 80 

Gin Asp Ala Leu Val Ser Cys Pro Cys Tyr Arg Leu His He Pro Thr 
8 5 * 90 95 

Tvr He Thr He Asp Glu Ser He Lys Thr Thr Thr Asn Leu Phe Met 

100 105 110 

Glu Glv Ala Phe Asp Thr Glu Leu Met Gly Glu Gly Ala Ala Ser Ser 
115 120 125 

Asn Ala Thr Leu Tyr Arg Val Val Gly Asp Ala Ala Leu Thr Gin Phe 
130 135 140 

Asn Met Cys Arg Val Asp Thr Thr Ala Gin Glu Val Gin Lys Cys Leu 
145 ISO 155 160 

Gly Lys Gin Leu Phe Val Tyr He Asp Pro Ala Tyr Thr Asn Asn Thr 
165 170 175 



Glu 



Ala Ser Gly Thr Gly Val Gly Ala Val Val Thr Ser Thr Gin Thr 
180 185 190 
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P-s Th- A-c se- Leu He Leu Giy Met Glu His Phe Phe Leu Arg As? 
195 200 205 

Leu Glv Ala Ala Ala Tyr Glu He Ala Ser Cys Ala Cys Thr Me: 

210 ' 215 220 

-vs Ala He Ala Val Leu His Thr Thr He Glu Arc val Asn Ala 

' 230 235 

Ala Va 1 Glu Glv Asn Ser Ser Gin Asp Ser Gly Val Ala He Ala Thr 
- 245 250 2=s 

As- G~ * ' "le Cys Pro Leu Pro He His Phe Leu His Tyr Thr 
va- n_ - ~s.. 2g5 270 

As- tvs • Ser Ala Leu Glr. Trp Pro He Tyr Met Leu Gly Gly Glu 
~ ^' Z ~S 280 2E5 

Lvs Se- Ser Ala Phe Glu Thr Phe He Tyr Ala Leu Asn Ser Gly Thr 
290 295 300 

L«u S— Ala Ser Gin Thr Val Val Ser Asn Thr lie Lys He Ser Phe 
3C5 ' 310 315 ^-0 

Aso ?~o Val Thr Tvr Leu Val Glu Gin Val Arc Ala He Lys Cys Val 
^ 32 5 330 33= 

P-o L« • A-= Asd Glv Glv Gin Ser Tyr Ser Ala Lys Gin Lys His Met 
" 340 ' 345 350 

Se- Asp Asc Leu Leu Val Ala Val Val Met Ala His Phe Met Ala Thr 
3 55 36 0 36 5 

As ~ H i s Met Tvr Lvs Pro He Ser Pro Gin 

370 " " 375 380 



(2! INFORMATION FDR SEQ ID NO : 6 : 

( i » SZO'SEKZZ CHAEACTEF.ISTICS : 

(A? LENGTH: 234 base pairs 
(B- TYPE: nucleic acid 
(C; STRANDEDNESS : single 
(D) TOPOLOGY: linear 

iii; MOLECULE TYPE: DNA (genomic) 

HYPCTHETI CAL : N 

liv! ANTI- SENSE: N 

(ix) FEATURE : 

(A) NAME /KEY: CDS 

(B) LOCATION: 1 . .234 
(D) OTHER INFORMATION : 



(Xi) SEQUENCE DESCRIPTION: SEO ID NO : 6 : 



A~G GAG CCA GTG GAT CCT GGA CAT GTG GTG AAT GAG AAA GAT TT. 

JUt Glv GlZ Pro val Asp Pro Gly His Val Val Asn G.u Lys Asp Pne 

! ' 5 10 ^ 

_ „„„ , - r. «-rr ar;T* rtA CCC CTT AGG GAG CAA GTG GTC 

SS ™i gTu g£ Phe Phe ^ So L.u Arg Glu Gin val V.i 

20 25 



96 
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GCG GGG GTC AGG GCA CTC GAC GGC CTC GGT CTC GCT GAC TCT CTA TGT 

Ala Gly val Arg Ala Leu Asp Gly Leu Gly Leu Ala Asp Ser Leu Cys 
25 4 0 * 5 

r ,n ir- AC 6 GAA AGA CTC TGC CTG CTG ATG GAC CTG GTG GGC ACS GAG 

His Thr Glu £g Leu Cys Leu Leu Met Asp Leu Val Gly Tr.r G,u 

-- 55 60 



5C 

t G ~ TTT GCG AGG GTG TGC CGC CTA GAC ACC GGT GCG AAA TGA 

Cvs Phe Ala Arg Val Cys Arg Leu Asp Thr Gly Ala Lys . 
£5 70 7 * 

(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 7 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 

Met Gly Glu Pro Val Asp Pro Gly His Val Val Asn Glu Lys Asp Phe 



5 



Glu Glu Cys Glu Gin Phe Phe Ser Gin Pro Leu Arg Glu Gin Val Val 
20 25 30 

Ala GH' Val Arg Ala Leu Asp Gly Leu Gly Leu Ala Asp Ser Leu Cys 
35 40 45 

v*s Lvs Thr Glu Arg Leu Cys Leu Leu Met Asp Leu Val Gly Thr Glu 
50 55 60 

Cvs Phe Ala Arg Val Cys Arg Leu Asp Thr Gly Ala Lys 

7 0 7 5 

CD ' U 

(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 58 5 base pairs 
(5) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
tiii) HYPOTHETICAL: N 
i;v) ANTI- SENSE: N 

(IX) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1 . . 5B5 
(D) OTHER INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:8: 



7T 



ATG AAG AGT GTG GCG AGT CCC TTA TGT CAG TTC CAC GGC GTG 
£e? £y"s Ser Val Ala Ser Pro Leu Cys Gin Phe Hi. Gly Val Pne Cys 

1 5 10 

CTG TAC CAG TGT CGC CAG TGC CTG GCA TAC CAC GTG TGT GAT GGG GGC 



144 



- c- 



23- 



48 



96 



I 
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Leu Tyr Gin Cvs Arg Gin Cys Leu Ala Tyr His Val Cys Asp Gly Gly 
20 25 30 

GCC GAA TGC GTT CTC CTG CAT ACG CCG GAG AGC GTC ATC TGC GAA CTA 14 4 

Ala Glu Cys Val Leu Leu His Thr Pro Glu Ser Val lie Cys Glu Leu 
35 40 45 

ACG GGT AAC TGC ATG CTC GGC AAC ATT CAA GAG GGC CAG TTT TTA GGG 192 

Tr~ Cv Asn Cys Met Leu Gly Asn lie Gin Glu Gly Gin Phe Leu Gly 

' *" ib 55 60 

CCG GTA CCG TAT CGG ACT TTG GAT AAC CAG GTT GAC AGG GAC GCA TAT 24 C 

Pro Val Pro Tvr Arg Thr Leu Asp Asn Gin Val Asp Arg Asp Ala Tyr 

65 '70 75 80 

CAC GGG ATG CTA GCG TGT CTG AAA CGG GAC ATT GTG CGG TAT TTG CAG 2 8 8 

His Glv Me: Leu Ala Cys Leu Lys Arg Asp lie Val Arg Tyr Leu Gin 

85 " 90 95 

ACA TGG CCG GAC ACC ACC GTA ATC GTG CAG GAA ATA GCC CTG GGG GAC 336 
Thr Trr> Pro Asd Thr Thr Val lie Val Gin Glu lie Ala Leu Gly Asp 
100 105 110 

GGC GTC ACC GAC ACC ATC TCG GCC ATT ATA GAT GAA ACA TTC GGT GAG 3 84 

Glv Val Thr Asp Thr lie Ser Ala lie lie Asp Glu Thr Phe Gly Glu 
115 120 125 

TGT CTT CCC GTA CTG GGG GAG GCC CAA GGC GGG TAC GCC CTG GTC TGT 43 2 

Cys Leu Pro Val Leu Gly Glu Ala Gin Gly Gly Tyr Ala Leu Val Cys 
130 135 " 140 

AGC ATG TAT CTG CAC GTT ATC GTC TCC ATC TAT TCG ACA AAA ACG GTG 460 
Ser Met Tvr Leu His Val He Val Ser He Tyr Ser Thr Lys Thr Val 
145 " ISO 155 16 0 

TAC AAC ACT ATG CTA TTT AAA TGC ACA AAG AAT AAA AAG TAC GAC TGC 52 8 

^vr Asn Ser Met Leu Phe Lvs Cys Thr Lys Asn Lys Lys Tyr Asp Cys 
1€5 170 175 

ATT GCC AAG CGG GTG CGG ACA AAA TGG ATG CGC ATG CTA TCA ACG AAA 576 
He Ala Lys Arg Val Arg Thr Lys Trp Met Arg Met Leu Ser Thr Lys 



180 185 190 

GAT ACG TAG 
Asp Thr 

195 

(2) INFORMATION FOR SEQ ID NO : S : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 194 amino acids 

(B) TYPE: ammo acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(XI ) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 

Met Lvs Ser Val Ala Ser Pro Leu Cvs Gin Phe His Gly Val Phe Cys 
1 ' 5 10 15 

Leu Tvr Gin Cys Arg Gin Cys Leu Ala Tyr His Val Cys Asp Gly Gly 
20 25 30 

Ala Glu Cvs Val Leu Leu His Thr Pro Glu Ser Val He Cys Glu Leu 
3 5 4 0 4 5 



585 
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G^v Asr. Cys Met Leu Glv Asn He Gin Glu Gly Gin Phe Leu Gly 
lb 55 60 

P-o Va" P-c T\~ Arg Thr Leu Asp Asn Gin Val Asp Arc As? Ala Tyr 
is 70 75 e: 

Glv Met Leu Ala Cys Leu Lys Arg Asp He Val Arg Tyr Leu Gin 
95 90 95 

Tro Pro Asn Thr Thr Val lie Val Gin Glu lie Ala Leu Gly Asp 



100 



105 



Glv Va" Asc Thr He Ser Ala He He Asp Giu Thr Phe Gly Giu 

115 ' 120 125 

-•s L«u Val Leu Gly Glu Ala Gin Gly Gly Tyr Ala Leu Val Cys 

130 125 140 

Ser Me: Tyr Leu His Val lie Val Ser He Tyr Ser Thr Lys Thr Val 

**v- As- S~r Met Leu Phe Lvs Cys Thr Lys Asn Lys Lys Tyr Asp Cys 
165 * 170 175 

•i e Ala Lvs Arg Val Arg Thr Lys Trp Met Arg Met Leu Ser Thr Lys 
* iBO 185 190 

Asp Thr 



INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 93 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
{ D ) TOPOLOGY: linear 



in > 



MOLECULE TYPE: DNA (genomic) 

i i i i > HY P OTHET I CAL : N 

;:v! Aim -SENSE: N 

iiXi FEATURE: 

(A) NAME / KEY : CDS 

( H ) LOCATION : 1 . - 93 9 

(D) OTHER INFORMATION: 

ixi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

A^G G-T AGC CGG AGG CGC AAA CTT CGG AAT TTC CTA AAC AAG GAA TGC 4B 

"i- Ala Ser Arg Ara Arg Lys Leu Arg Asn Phe Leu Asr. ^ys Glu .ys 

' "i 5 10 -5 

t-TZ. t GG A"- GTT AAC CCA ATG TCA GGG GAC CAT ATC AAG GTC TTT AAC 96 

He ^ Val Asn Pro Met Ser Gly Asp His He Lys Va: Phe Asn 

* " 20 =5 30 

_ — ATC TCG CCG GTG TAT GAC CCT GAG CTG GTA ACC AGC 144 

Xla Cvs Th- Ser He Ser Pro Val Tyr Asp Pro Giu Leu Val Thr Ser 
35 40 45 

— - r-A C-G AGC GTG CCT GCT TAC AAT GTG TCT GTG GCT ATC TTG CTG 192 
Ty- A^a Leu Ser vll Pro Ala Tyr Asn Val Ser Val Ala He Leu Leu 
50 55 60 
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2 04 




G * r-. ■ - \-3J~- — 

Val Pre Asp 
13 C 



Leu Tr.r Arc Glr. G:u 
12 5 



Gnn _ \J - - — -~- - -J-i- — , w , * T 

Leu Val K.s Tr.r Ser Glr. Va. .a. 



=!£ =5= ?fv SI lIu ?II SI lys Ti-.r Me-. Giy Leu Glu ?n« Arc Asr. 



15: 



16 C 



. G7T TGG GTC GGG GGC GGA TCG G.o --^ - - - - - 

^. 5« Pii val Trp Leu Giy Giy Giy Ser V*i Tr= Leu ^u P ne 



GAI TAC ATG GGG TTC TGT CCG GGT GTC GAG Go- ^ 
As- Tyr Me: Ala Phe Cys Pro Giy Va. Asp ^-y Me- f.c 



Leu Giy Val ? ^ - - - - , g 



ib; 



165 

t — • G - 3 GCC GGG CTG CTT ACC AGG TGI GAG CAI ~ -~ — - 

:;r: 1-2 £~ Va! Xll Ala Leu Leu Thr Arg Cys Asp K.s Pre Asp 

= " I- 200 205 



'AC TGT CAT GGA GTC CGT GGA GAG GTT AAT GTA 
Glv Leu Ar= Glv Kis Val Asr. Va. 
**" "" " -15 HO 



- -'T GGT GGG 
Phe Ar= G 1 y 



Tvr Cvs sir Ala Glr* Se'r Pre Giy Leu Ser Asr. lie Cys Pre Cys lie 



Lvs Ser _ys 



OJO 

Jl -» D 



A-.. - - - 

Asr. Phe Leu Giy Leu 



AAT GGA GTG ACT AGG GTC ACT GG- 

Asr. Glv Val Tr.r Are Val Tr.r Giy Asr. Ar= 
25: 

TTI GAT CC1 ATT GTC GAG AGC AG G GTA AC~ 

Pr.e Asp Pre lie Val Glr. Ser Arg Va: Tr.r 

2 c 5 ■ - ' " 



^_a i-s- 



A3C caC CCA ACC GCC ACG CAC GTC « - f 

— : K1S Pre Tr.r Pre Tr.r His Val Glu Asr. Va. 



CTA AC A GGA GTG CT. , p ^ Q£> : Gl - Gi , 

--.^ ... val Leu Ase Asp i-e- ««- 



3 0 5 



Leu Glv Pre Leu .r.r 
310 



_ . . 

r.r Asr. Val 



57£ 



524 



76 = 



Elf 



6 6 -i 



INFORMATION "Or. SZw I- K-:-- 
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ii) SEQUENCE CHARACTERISTICS: 

IA) LENGTH: 3 II amine acids 
(3; TYPE: aT.in: arid 
ID! TOPOLOGY: linear 

;ii- MOLECULE TYPE: proceir. 

ixi) SEQUENCE DESCRIPTION : SEQ ID NO: 11: 

Met Ala Ser Arg Arg Arc Lys Leu Arg Asr. Phe Leu Asr. Lvs Glu Cvs 
- 5 1C 15 

He Tre Thr Val Asr. Pre; Me: Ser Glv Asr His lie Lvs Val Phe Asr. 
2Z 25 * 3C 

Ala Cvs Thr Ser lie Ser Pro Val Tvr Aso Pre Glu Leu Val Thr Ser 
35 40 ' 45 

Tyr Ala Leu Ser Val Pro Ala Tyr Asr. Val Ser Val Ala lie Leu Leu 
50 55 60 

His Lys Val Me: Gly Pro Cys Val Ala Val Glv lie Asr. Glv Glu Met 
£= . 7c 7= " e: 

lie Me: Tyr Val Val Ser Gin Cys Val Ser Val Arc Pre Val Pro Glv 
e5 90 95 

Arg Asp Gly Me: Ala Leu lie Tyr Phe Gly Gin Phe Leu Glu Glu Ala 
100 105 no 

Ser Gly Leu Arg Phe Pro Tyr lie Ala Pro Pro Pro Ser Arc Glu His 
11= 120 125 

Val Pro Asp Leu Thr Arg Gin Glu Leu Val His Thr Ser Gir. Va~ Va* 
13C 135 140 

Arg Arg Gly Asp Leu Thr Asr. Cys Thr Me: Gly Leu Giu Phe Arc Asr. 
= 150 1 = 5 " 160 

Val Asr. Pre Phe Val Trp Leu Giy Gly Glv Ser Val Trr Leu Leu Phe 
i£5 170 * 175 

Leu Gly Val Asp Tyr Me: Ala Phe Cvs Pro Glv Val Asr Glv Me: Pro 

ie: i'b5 ' * 19; 

Ser Leu Ala Arg Val Ala Ala Leu Leu Thr Arc Cvs Aso His Pro Aso 
1=5 2 0C 2 05 

Cys Val His Cys His Giy Leu Arg Gly His Val Asr. Val Pr.e Arc Glv 
21 : 215 22: 

Tyr Cys Ser Ala Glr. Ser Pre Glv Leu Ser Asr. lie Cvs Pro Cvs lie 
225 230 235 ' 24C 

Lys Ser Cys Gly Thr Giy Asr. Gly Val Thr Arg Val Thr Glv Asr. Arc 
245 25C " 255 

Asr. Phe Leu Giy Leu Leu Phe Asp Pro lie Val Gir. Ser Arc Val Thr 
26 0 265 270 

Ala Leu Lvs lie Thr Ser His Pro Thr Pro Thr His Val Glu Asr. Val 
275 28C 255 

Leu Thr Glv Val Leu Aso Aso Glv Thr Leu Val Pro Ser Val Gir. Glv 
290 295 300 

Thr Leu Giy Pro Leu Thr Asr. Val 
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2Z-z 3-C 

: - i::f d?_mat i on for sz; z d no : 1 2 : 

• i ££«uz::rz characteristics: 
(A; LENGTH: Be base pairs 
(S'= TYPE: r.u:is:: acid 
Ci S TRANDEDNZ S S : single 
(D: TOPOLOGY: linear 

KGLECL"LE TYPE : DIC A 1 genox:: > 
i v Aim - sense : :: 

; IX FEATVRE : 

I A NAKE/rlZY: CDS 
■;r: LOCATION: 1 . . 8£ 

:c. other information: 



: xi 



SEGUE NCE DESCRIPTION: SEC ID KC : 1. 



ATG GAC TCA ACT AAC TCT AAA AGA GAG TTT ATT AAG TCG GCT CTG GAG 4 5 

Asc Ser Tr.r Asr. Ser Lvs Arc Glu Phe lie Lvs Ser Ala Leu Glu 

L 5 1C 15 

GCC AA0 ATI AA0 AGG AGG GCA GCT GTA TCG CTA TTT GA 8c 

Ala As- lie Asr. Ar= Ara Ala Ala Val Ser Leu Pr.e 

20 25 



:i INFORKATION FOR SEC ID NO: 13: 

.1. SECUZNCE CHARACTERISTICS: 

(A LENGTH: 22 amino acids 
iE; TYPE: air.ir.o acid 
iZ TCP CLOG Y : linear 

,ii r>*GLZ OYLZ TYPE: protein 

:>:i £ZZ}Z7ZKZZ DESCRIPTION: SEC. ID NO: 13: 

mc- jl s - e = - Tr.r Asr. Ser Lvs Arg Glu Pr.e He Lvs Ser Ala Leu Gl; 

5 i: 15 

-la Asr. He Asr. Arc Arg Aia Ala Val Ser Leu Pne 
2: 2 = 



INFORMATION FOR SZZ IT NO: 14: 

; 1 S E DUZN CE CHARACTER. I S T I CS : 

;A' LENGTH: 1^43 base pairs 
•3- TYPE: nuclei c acid 
; Zt S TRANDZDNZS S : single 
iD) TOPOLOGY: linear 

(ii; MOLECULE TYPE: DNA t genomic; 
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r;A~ GAA "~CA AG- A — ^- - . 

Asr Glu Ser Ser Thr Ser Thr Thr Ser Thr As? Me- Asp Asp ^eu Pre 



195 



GOO 205 



T~ G" ATA GG3 CM. ~r ATT AAA TCA TGG TTG 0=3 ATC Kl GAA T 

ser Aia lie Gly GIr. Asr. :ie Lys Ser Tr? Leu G,y iA e Se. 

21C CIS ^20 

TCG GCG ACT GCT GTG CCG GTG ACC ACG CAG CTT ATG GTA CC3 GTG CAG 



240 



iiv anti- sskse: i; 

i FEATURE : 

(A; NAME /KEY: CDS 
tE ■ LOCATION: 1 . .1743 
(C: CTHEr. INrOP-MATION: 

ixi SEQUENCE DESCRIPTION: SZQ ID NO: 14: 

*— C" GAA GGT TTT GGA GCG GAC TCG GTG GGG CGC GGC GGA GAA 

^:;Z G r v Glv Phe Giv Ala Asp Ser Val Gly Arg Gly G_y G-u 

*.C7 AGG GGA GGC AGG TGG GAG TTG GGC- ACT TCG GAC 

~s Val Thr Arc Glv Gly Arg Tr? Asp Leu Gly Ser Ser Asp 

~- " ' 25 31- 

GAC GAA TCA AGG ACC TCG ACA AC 3 AG 3 ACG GAT ATG GAG GAG CTG CCT 
ier 
25 

GA- A~~ AAA CCA CTA AGG GGA AAG TCT GTA AAA A3C TGG TAG ATA 

Glu Gil Ari Lys Pro Leu Tr.r Gly Lys Ser Val Lys Tr.r Ser Tyr G.e 
5 Z - ^ ^ ^ 

TA" GAG GT3 CCG ACG GTC CCG AGG AGC AAG CCG TGG CAT TTA ATG CAG 
TV- As- Va' Pre Thr Val Pre Thr Ser Lys Pro Trp Kis Leu Me- His 
£5 7 0 75 80 

— AA ' TAG GGA ACG CCT AGG TTT CCG CCG AGA CCT CTG ATA 286 

As-. As-. Ser Leu Tvr Ala Thr Pro Arg Phe Pro Pro Arg Pro Leu ^±e 

SS 90 5j 

-G~ CA~ CCT TCG GAA AAA GGC AGG ATT TTT GCG AGT CGG TTG TGA GCG 
v-s £er Glu Lvs Glv Ser He Phe Ala Ser Arg Leu Ser 

r 10 G " 105 HO 

A" GA" GAG GAG TCG GGA GAG TAG GCG CCA ATG GAT CGC TTG GCG TTG 
-« Asr Asr Asr Ser Gly Asp Tyr Ala Pro Met As? Arg Phe A.a Pne 
" --'z 12C 1- = 

-A3 AGG GCG A33 GTS TGT GGT CGG CCT CCG CTT CGG CCT CCA AAT CAG 
~ c e - Ar= Val Cvs Gly Arg Pro Pro Leu Pro Pro Pro Asr. r.is 

„ — - Ar - AG3 C C3 GGA GAG GCG TGA ATG GGG GAG GTG GGC 

Z,Z" A~a ~" Ar= Pro Ala Aso Ala Ser Me. Gly Asp Val Gly 

" 15 o 155 i°o 

-3- --3 GAT GTG GAG GGA CTG AAG AG 3 AGG CCA AAG- GGA TTT TTA AAA. 
rr; ^ Leu Gln Gly Leu Lys Arg Thr Pre Lys Gly Phe Leu -ys 
lc5 l^O 

"A TCT AGG AAG GGG GGG AGT CTG AAA GGG CGT GGA CGG GAT GTA GGT 
s G-v Glv Ser Leu Lvs Ala Arg G.y Arg «sp G... 
TSO " " ^5 150 

, . G - - G ^ 0 GGC TTT AGT CCT AG 3 GGG GTG AAA 

12; Zr= ^; ^ s .r ,r= Ar f G i y v ai ^ 



336 



3B4 



•SBC 



671 
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ser Ala Tr.r Ala Val Pre Val Thr Thr =1- Me, Val Pro Val His 

22 = ~ jC 

, AGA A 1 "- C-T GTC- AC2 GTG GAC TAG AGG AAT GTT TAT TT2 CTT 

Leu Ar=" ^ Pro Val Thr Val Asp Tyr Arc Asn Va_ - 

245 - = * 

--• ATG GST GTG G3T AAA TTA A" TT3 GTT AAT 3" 

^ G~ val Gly val Gly Lys Ser T.-.r Leu Va- Asr. ^« 

-■ 26C 265 

. c- —Z CAG GAG A3A 3T3 ATA AGT TTT "T GAG r~ 

Va! =vi V.y lie Liu P~ Gin Giu Arc Val Thr Ser Phe Pre G-u ,r= 

r 28 J -w_ 

a""" AGG GCA TTT ACA GAT 737 7A2 AAG GAA ATT 7CC 

-V ^: 7~ ~~ Aia p« ^ asp = ys t> t l v . c-_u ... 
;;: - 2=5 

, — G -- AAG GCG GGA GAT CC3 CTG A2G T2T GTC AAA 

£S Leu JUS ~I Ser = Iy I*. Ala Gly Asp Pro ,eu T,r Ser A.a £, 

„ , ~- — T- --3 CCC TT2 CGG AGG AA2 GCC 

Tyr J£ Cys S£ ^ ~s Phe Ser £ u Pro Pne Ar, T,r A« Al. 



325 



- CTG c « ATG ATG CAG CC2 TGG AAC GTT GGG GGT GGG TTT 

~?.r Ilk ill Leu Arc Me, Me, Gin Pro Trp Asn Va. G,y ..y «... Se. 

340 34D 

. A""~ CAT TGG TGC GTT TTT GAT AGG CAT CTC CTC TC2 CCA 

Ity Ir= Gly ??i Sts Trp Cys Val Phe Asp Arc His Leu Leu Ser Pre 

c — CTC ATG CAT CTG AAG CAT GG2 CG3 TTA 7CT TTT 

1-2 vi ? vil ikl Pro Leu Met H^s Leu Lys H:s Gly Arc 

111 375 380 

T CAA TTA TTT TCC ATT TTT AGA GTC ATA 



350 355 



Asr His Pr.e P 
255 

- — - TC A CT C72 TCC AG 2 GCC GAG 

'f.:'f"f.'~ J['l £eu Thr Leu Ser Ser Ala Giu 

' " " 42-3 4L0 



G - G TGG GTT TAT CAC GC2 GTG TAC T"T TCA 

~~ "J. ~ a -~ Ala Tvr His Ala Val Tyr Cys Ser 

:!= ' - c 445 

^ — GTG GAG CA3 ATG GTA CAA CTA ZZZ 

I"" I -I- va" Giu Glr. Me- Val G.r. Leu Cys 

-8- T -- r 4t: 46: 

45: 

» — TG" TT2 CG2 AG 2 GT3 CGC CTZ- 

^1 Pr= Gi^ Cys P,e Ar, Ser Val Ar, Leu 

;y; 470 

AAA AA" CTT CAC GAG CAG AGC AT 3 CTA CCT ATG ATC 

GA2- GAA A_. --o ^ r -. .. m»> L»u Pro Me, lie 

Giu Giu Tr.r Leu Lys Asn Leu His G— o Se. M_ 
4E5 



S6- 



lOGc 



105c 



1104 



Leu 


Ser 


Phe 




GAA 

Glu 


GG2 

Gly 


GAC 
Asp 
400 


I20C 


CGG 
Arg 


CGG 
Arg 
413 


GTC 
Val 


1248 


AA2 
Asr. 
43: 


Tyr 


ATC 
lie 


1256 


TGG 
— ~ 


AT 2 
lie 


ATG 
Me, 


1344 


GT.— . 

Vc.1 


CAA 
Glr. 


AC2 


"352 


GTA 
Ala 


His 


AAG 
Lvs 
48C 


1 4 4 Z 



148E 
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— GTA r-G GAT CCC GTG AGA CAT CAT CCC GTC GTG ATC GAG CTT 

-"I Glv VaT Leu Asp Pre Val Arc H,s Kis Pre Val Val lie Glu .eu 

" 500 505 

3 TT= AGA GAG CTG AGA AAA TTA CAA TTT AT 3 GTA GCC 

-vs evs Pne Phe Thr Glu Leu Arc Lys Leu Glr. Phe :,e -a 

-> s - - 52C 52 = 



515 



GA~ GCG GAT AAG TTC CAC GAC GAC GTA TGC GGC CTG TGG ACC GAA ATI 
A^p- lla Asp Lys Phe His Asp Asp Val Cys G,y ,eu .r? .... G„ _.e 



33: 

CTG TCC AAT CCC- GCT ATT AAA CCC AGC- GCC ATC AAC 

~?Z G : lie Leu Ser Asr. Pre Ala He Lys Pre Arc a* a — e -sr. 

I-; — - 550 555 



„ ™ T ~ GAG AG" CAC- TCT AAA GCA GTT AAT CAC C.A o—- 
;:.* v a Leu Glu Ser Glr. Ser Lys Ala Val Asr. His ^eu ^ o_u 

ACA TGC AGG GTC TAG 
Cvs Arc Val 

3 5-: 

(i: INFORMATION FOP SE2 IE NO: 15: 

SEQUENCE CHARACTERISTICS: 

( A ' LENGTH : 5 8 0 amine a ei ds 
(51 TYPE : amine acid 
(D* TOPOLOGY: linear 

Iii' MOLECULE TYPE: protein 

• SEDCZNCE DESCRIPTION: SEO ID NO: 15: 

Me- a" a Glu 31 v Giv Phe Gly Ala Asp Ser Val Gly Arc Gly Gly Glu 

Lys Ala Ser Val Thr Arg Gly Gly Arg Trp Asp Leu Gly Ser Ser Asp 

c.-r Ser Thr Thr Ser Thr Asp Met Asp As? Leu Pre 

" 40 -5 



g*- A~c Lvs Pre Leu Thr Gly Lys Ser Val Lys Thr Ser Tyr He 
— : c - 55 ec 

- a - p-c Thr Val Pre Thr Ser Lys Pre Trp His Leu Met Kis 

63 ' c . 

As- A— S— Leu Tnt Ala Thr Pre Arc Pne Pre Pre Arc Pre Leu lie 

E5 9: 

Arc His Pre Ser Glu Lys Gly Ser lie Pne Ala Ser Ar= Leu Ser Ala 

Thr Ase Asr Asp Ser Gly Asp Tyr Ala Pre Mel Asp Arc Pne Ala Phe 

115 - 2Z 

Gin Ser Pre Arc Val Cys Gly Arc Pre Pre Leu Pre Pre Pre Asr. His 

130 15 = 

Pre Pre Pre Ala Thr Arc Pre Ala Asp Ala Ser Mer Gly Asp Val Gly 
145 150 

Ala Ase Leu Gin Gly Leu Lys Arc Thr Pre Lys Gly Phe Leu Lys 
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L70 



160 



-■•v G'v Ser Leu Lvs Ala Arg Gly Arc Asp Val Gly 



Asi: Arc Leu Arg Asp Gly Gly Phe Ala Phe 

- c; 2 0 C 

Ser Ala He Gly Gin Asr. lie Lys Ser Trp 



= Ala Thr Ala Val Pre 

23C 



_ „ _ *..r 



Ser Pre Ar? Gly Val Lys 
205 

Leu Gly lie G-iy Giu Ser 
2 2 C 

Leu Me- Val Pre Val His 
- - ~ 2 -i C 



Val Asr Tvr Arg Asr. Val Tyr 
2 5 0 



Leu He Arg Thr Pre Val 1 
2 4 r 

Tvr Leu Glu Gly Val Met Gly Val Gly Lys Ser Thr Leu Val Asr. Ala 



265 



a'" Leu Pre Gir. Glu Arg Val Thr Ser Phe Pre Glu Pre 

• & " 111 — " 280 =2 = 



Me: Val Tvr Trp Thr Arg A* a 
~e- ' 255 



Phe Thr As? Cys Tyr Lys Glu He Ser 



30; 



-j-s L— Lvs Ser Glv Lys Ala Gly Asp Pro Leu Thr Ser Ala Lys 

3 *-j " 31*0 315 ->20 

- V c G'-r. Asn Lvs Phe Ser Leu Pro Phe Arg Thr Asr. Ala 

— "" 325 ' 330 335 

. - le Leu Ar= Met Met Gin Pro Trp Asr. Val Gly Gly Gly Ser 



34 C 



345 350 



G'v A-= Glv Thr His Trp Cys Val Phe Asp Arc His Leu Leu Ser Pre 



36 0 



... o'-o l°u Met His Leu Lvs His Gly Arg Leu Ser Phe 

I-! t& " 375 * 360 

._■-«= ^ o — Glr. Leu Leu Ser lie Phe Arg Ala Thr Glu Gly Asp 

" = C *" 390 355 ' i0C 



. - - -. . --. e - e - ^' — Leu Ser Ser Ala Glu Ser Leu Arg 
405 410 



;-i.r = ^ a » 

41- 



Arc Ala Arg Gly Arg Lys Asr. Asp Gly 

'•y'r* 425 



Thr Val Glu Gin Asr. 

43 0 



Trr A_ a 



Tvr His Ala Val Tyr Cys Ser Trp lie 
440 443 



4 5 C 



^ ; 



Gin Met Val Gin Leu Cys Val 
46: 



Ac - - - e s>~o Glu He Cvs Phe Arg Ser Val Arg Leu A.; 



Glu Glu Thr Leu Lys Asr. Leu His Glu Gin 



Ser Met Leu 



His Lvs 
4 80 



485 



490 



— Glv val Leu Asp Pre Val Arg His His Pro Val Val lie 
500 505 

Cys Phe Cys Phe Phe Thr Glu Leu Arg Lys Leu Gin Phe He 



Val Ala 



520 
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Asd Ala Asd Lvs Phe Bis Asp Asp Val Cys Gly Leu Trp Thr Glu He 

53C " D 

^ Arc Gin He Leu Ser Asr. Pro Ala lie Lys Pre Arc Ala lie Asr 
5 is " £50 

Trp Pro Ala Leu Glu Ser Gin Ser Lys Ala Val Asr. Kis Leu Glu Glu 



565 



Thr Cvs Arc Val 

ss: 

information for sec id no:1£: 
ii sequence characteristics: 

iA. LENGTH: Ilr3 base pairs 

(5 TYPE : nucleic acid 

■ - ■ STPJkNDEDNESS : single 

[Z TOPOLOGY : linear 

111. MOLECULE TYPE : DNA (genomic) 

{av Aim -SENSE: N 

■'ix- FEATUR.E : 

f A ) NAME /KEY: CDS 

(3; LOCATION: 1..21S3 

CD'. OTHER INFORMATION: 



txi; SEQUENCE DESCRIPTION: SEO. ID NO: 16: 

„~ G -- ~ G CC TTC TTG GCG GCC CTT GCA TGC TGG CGA TGC ATA 
Sir. Giv Leu Ala Phe Leu Ala Ala Leu Ala Cys Trp Arc Cys lie 

r * 5 io - s 

GG a g — ACT GGC GOG TTG CCG ACA ACG GCG ACG ACA 

Ser Leu Tr.r Cvs Gly Ala Tr.r Gly Ala Leu Pro Thr Tr.r Ala Thr Tr.r 

2 2 " 25 30 . 

... . _~ G ~ ACG CAG CTC ATC AAT GGG AGA ACC AAC CTC TCC 

He Tr.r lr= Ser Ala Tr.r Gin Leu He Asn Gly Arc Thr A sr. Leu Ser 
- z. 4 0 ^ - 

—A GAA —3 GAA TTG AAC GGC ACT AGT TTT TTT CTA AAT TGG CAA AAT 
G - -■ Glu Phe Asr. Gly Tr.r Ser Pne Phe Leu Asn Tr? «.n -sr. 

* — A" GAG CCG GCC CTG ACA GAG TTG 'TGC ACC TCC 

l;:: £r. ?*r ^ =-•- ?~ ~* ^ ^ - s - 

, E 70 7= 

— GAA G— G" GAG GAC CTC AGG GTA ACT CTG AAA AAG AGG CAA AGT 
rr- 3- va r Ala Glu Asr Leu Arc Val Thr Leu Lys Lys Arc G.r. Ser 

ft-O o-- - ^_ 9Q c- 



AAC AAG ACA GTT GTG ATC TCT GGA GAC GGC CAT 

Liu Phe Phe Pre Asn Lys Tnr Val Val lie Ser Gly Asp G.y His ~r= 

1 k; J w _ 

, G - G —g ACG TCG TCG CAA ACT TAT AAC ATC ACC AAG 

~- Cvs gTu Val P^o Thr Ser Ser Gin Thr Tyr A f r. lie Thr Lys 
-; 3 120 

GGC TTT AAC TAT AGC GCT CTG CCC GGG CAC CTT GGC GGA TTT GGC- ATC 



45 



14 4 



24 : 



26 = 



3E4 
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G*v Phe A sr. Tvr Ser Aia Leu Pro Glv Kis Leu Giv Gly phe Giy lie 
~" 130 * 135 * 140 

AAC GCG CGT CTG GTA CTG GGT GAT ATG TTC GCA TCA AAA TGG TGG GTA 4 5: 

As- Ala Arc Leu Val Leu Giv Asp He Phe Ala Ser Lys Trp Ser Leu 

145 " 15C 155 16C 

ttc ggg agg gag agg cga gag tat cgg gtg ttt tat cca atg aat gtc sz = 

Phe A" a Arc: As- Thr Pro Glu Tvr Arc: Val Phe Tvr Pre Me: A sr. Val 
* 1=5 17C 175 

ATG GCG GTC AAG TTT TCC ATA TGG ATT GGG AAC AAT GAG TGG GGG GTA = ~6 

n'. c ~ Val Lvs Phe Ser lie Ser lie Gly Asr* Asr. Glu Ser Giy Val 

is: iE5 iec- 

GTG GTG TAT GGA GTG GTG TGG GAA GAT TTG GTG GTG GTZ ?,ZZ CTG GAT = C4 

J^la Leu Tvr Glv Val Val Ser Glu Asp Phe Val Val Val Thr Leu His 
1=5 " 20C 2C5 

AAT AGG TGG AAA GAG GGT AAG GAG AGG GGG TGG GAT GTT GTG TTG GGT €~Z 

As- Arc Ser Lvs Glu Ala Asr. Glu Thr Ala Ser His Leu Leu Phe Giy 
2i: ' 215 Z2C 

GTT GGG GAT TGA GTG GGA TGT GTG AAG GGG GAT GGG AGG TAT GAT GAA ~ZZ 

Leu Pr: Asr Ser Leu Pre Ser Leu Lvs Giv His Ala Thr Tyr Asp Glu 

Z2z ' 230 235 240 

GTC AGG TTG GGG GGA AAG GGA AAA TAT GGG CTA GTG GGG ATG GTG GGT 7 6 £ 

Leu Thr Phe Ala Arc: Asr. Ala Lvs Tyr Ala Leu Val Ala lie Leu Pro 
245 250 255 

AAA GAT TGT TAG GAG AGA CTG GTT AGA GAG AAT TAG AGT GGG ATA TTT Sic 

Lvs Asc Ser Tvr Gin Thr Leu Leu Thr Glu Asn Tvr Thr Arc lie Phe 
2SZ 265 " 27 0 

GTG AAT ATG AGG GAG TGG AGG CGG CTG GAG TTG AGG CGG AGG ATG GAG 8 64 

Leu Asr. Me: T.-.r Glu Ser Thr Pro Leu Giu Phe Thr Arg Thr lie Gir. 
Z"5 2 B C 255 

AGG AGG ATG GTA TGA ATG GAG GGG AGG CGG GGG TGG GGA GGT GAA GAG 912 

Tr.r Ar= He Val Ser lie Glu Ala Arg Arg Ala Cvs Ala Ala Gir. Glu 

255 300 

GGG GCC- GGG GAG ATA TTG TTG GTG TTG TTT CAG ATG TTG GTG GGA GAG 96: 

Ala Ala Pre Asr He Phe Leu Val Leu Phe Gin Met Leu Val Ala His 

315 * 31C 315 3 2 C 

TTT GTT GTT GGG GGG GGG ATT GGG GAG GAG GGA TTT GTG GAG GTG GAG 100= 

='^e Leu Val Ala Arc Glv He Ala Giu His Arc: Phe Val Glu Val Asp 
325 " 33: " 335 

TG2 GTZ- TGT GGG GAG TAT GGG GAA GTG TAT TTT CTG GGG GGG ATG TGG 10 56 

Cvs Val Cvs Arc: Gin Tvr Ala Giu Leu Tvr ?.ie Leu Arg Arg He Ser 

340 " 345 35: 

GG" GTC TGG ATG GGG AGG TTG AGG AGT GTG GGG TAT AAC GAG AGG AGG 1104 

Arc Leu Cvs Ke: Pro Thr Phe Thr Thr Val Gly Tyr Asr. His Tr.r Tr.r 
255 360 365 

GTT GGG GGT GTG GGG GGG AGA CAA ATA GGT CGG GTG TGG GGG AGG AAG 115 2 

L«u Glv Ala Val Ala Ala Tr.r Gir. He Ala Arg Val Ser Ala Thr Lys 
37': 3" 360 

TTG GCG AGT TTG CGG CGG TGT TGG CAG GAA ACA GTG CTG GGG ATG GTG 12 CC 

Leu Ala Ser Leu Pro Arc; Ser Ser Gir. Glu Thr Val Leu Ala Me: Val 

"i £ = 390 395 40C 
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- -5 



-A3 err GoC ggg cgt gat ggg ggg 

^ir. Leu Giy Ala Arg Asp G.y ^s. 



TGG TCC A-- w » *J \3*\^S OU. 

Vai Pro Ser Ser - - - Leu Giu Gly 

4lG 415 



ATT GCT ATG 3T- 

i.ls Me: Vai Vai G-lu 
42C 

AGA GTG GGG GAT ATT GAA 
Tr.r Leu Giy Asp Tr.r G-u 



CAT ATG TAT ACT G-- ~- - « • r * 

His Me: Tvr Tr.r Aia Tyr Tr.r Tyr wa_ . yr 



AG A AAA TTA ATG TTG GAT ATA GAG ATG GTG 

Arg Lys Leu Met Leu Asp i-£ .-is .r.r v<=. 



AAA GAG TGG GGA GTA TGA GAr-. 
2Z; ^li ^-c Pro Lvs Aso Ser Giy Vai Ser Giu Lys Leu 



AGA AGA TA. - . --^ — « JJ-^ As _. _ l£ .... Le „ 



4-C 



4b; 



. _ — - .._ ... CGG GAG AGG CTT - 

G"-- lie Aia Arg Phe Ser Lys Pro ~*p «i - 



«. = 3 



490 



"is Pr.e Ser Pre- Cys Phe Leu Giy Leu Arg Tyr Asp Leu 



"7 CTA GGA GTA AGG TA- Gn.. 



soc 



505 510 



AAG ~TG CGG GGG GAG GGG CCG CAG TGG TGC GCT GTG AGG 

-I ~« L-u Arc Aia Giu Aia Pro Gin Ser Ser Aia Leu Tr.r 

= r E - £20 



His Pro 



— 



:GG GTT - njft ^J\Ji-k 

v la Vai A- a Arg «-y 

5 3 0 



AGA TGG GGA TTG GGA GAA TTG GTG GA_ 
ser Giv Phe Aia Giu Leu Leu His 
535 * 540 



O w J u 

Aia Leu 



CAG GTG GAT AG G TTA AAT TTA AT. C^ G_o ~ * . _ 

lie Pro Axa lie Asr. Gys Ser -\ s -ie 

c;; 56 C 



His Leu Aso Ser Leu A sr. Leu 
" 5 50 



146 = 



1584 



1631 



16 8 0 



G ATA ATA GCT AGG GTA CCC TTG GCT CAG GTG AGG TAT - ■ . e 

-"■e Aia Tr.r Vai Pre Leu Pro His -a. .r.r . yr 
570 5 " 5 
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-7T GAT AAT AAC AAC CTA CAC ATT CAT TAT TTG TGG CTG AGG GAC AAC 
?Ae Asr Asr. Asr. Asr. Leu His lie His Tyr Leu Trp .eu ~rg Asp As.. 
£7= ^ 8 C 

GG~- A" GTA GTG GAG ATA AGG GGC ATG TAT AGA AGA CGC GCA GCC AGT 
-r. Val Glu He Arc Gly Mer Tyr A^g Arg Arc A_a ~-a Ser 

l^l LkZ Phi Leu" lie Leu Ser Phe He Gly pie Ser Gly Val He Tyr 
7CE 710 

„ , —a TTG TTT TCC ATC CTT TAT TAG 

on. £eu Tvr Arc Leu Phe Ser He Leu Tyr - 
7 2 5 73 C 

;i; INrOP-MATION FOP. SEC. ID NO:!": 

i SECUENCE CHARACTERISTICS: 

(A'« LENGTH: 73 0 ammo acids 
{ b ; TYPE: ar.ir.o aoic 
;c; TOPOLOGY: linear 

■.ii' MOLECULE TYPE: proceir. 

ixi SEQUENCE DESCRIPTION: SEQ IL NO: 17: 

Mo: Glr. Glv Leu Ala Phe Leu Ala Ala Leu Ala Cys Trp Arg Cys He 

5 10 - r 

Ser Leu Thr Cvs Gly Ala Thr Gly Ala Leu Pro Thr Thr Ala Thr Thr 
20 25 30 

Ser Ala Glr- Leu He Asr. Gly Arc Thr Asr. Leu Ser 

— ;i 40 45 

He Glu Leu Glu Phe Asr. Gly Thr Ser Phe Phe Leu Asr. Trp Glr. Asr. 



5: 



5 5 60 



- to-- as- va" He Thr Glu Pro Ala Leu Thr Glu Leu Trp Thr Ser 
"H " " 7C 75 8C 

-a Val Ala Glu Asr Leu Arg Val Thr Leu Lys Lys Arc Glr. Ser 

^ e5 90 95 

Leu Phe Phe Pro Asr. Lys Thr Val Val He Ser Gly Asp Gly His Arc 

io: ic-= 

G ^u Val Pre Thr Ser Ser Glr. Thr Tyr Asr. He Thr Lys 

III ' 12C H5 

-.. as- — Ser Ala Leu Pro Gly Kis Leu Gly Gly Phe Gly He 

n: * " -35 i4C 



145 

Phe Ala 



Asr. Ala Arc Leu Val Leu Gly Aso He Phe Ala Ser Lys Trp Ser Leu 

150 ' 

Arc Aso Thr Pro Glu Tyr Arg Val Phe Tyr Pro Mer Asr: Val 

165 - 7C * ' 



Me, Ala Val Lys Phe Ser He Ser He Gly Asr. Asr. Glu Ser Gly Val 

1B0 1£ - _y 

A* a '-u Glv val Val Ser Glu As? Phe Val Val Val Thr Leu His 

,05 * 200 22= 
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Asr. Arg Ser Lys Giu Ala Asr. Glu Th: 



Leu Pre 



Asc Ser Leu Pro Ser Leu Lys 
230 

?ne Ala Arg As- Ai 
245 



a Lvs Tvr 



Ala Ser His Leu Leu Phe Giy 
220 

Gly His Ala Thr Tyr Asp Glu 
225 

Ala Leu Val Ala lie Leu Pre 
250 255 



Lvs Asr 



Ser Tyr 



As- Me- Tr.r «iu 



Ser Tr.r 



?r : 
28; 



2c: 
Leu 



- lie Val Ser lie Glu A- a Arc Arc 



Phe Thr Arg 
"'5 5 

Ala Cys A- a j-i-S 
3 00 



_~e 



i'a -a ?r: Asr lie Phe Leu 
3D 5 " 310 

Pr.e Leu Val Ala Arg Gly He 
2 2 5 

Cvs Val Cvs Arg Gin Tyr Ala 
340 



Val Leu 



Ala Glu 



Phe Glr. Me: Leu Val Ala His 



His 
330 



j 1 — 

Arc Phe Val 



32C 
Val Asp 

~ "S " 



Glu Leu Tyr 
345 



^c L»u Cvs Met Pre Thr Phe Thr Thr 

360 



2 7C 



355 

Al a Val Al a Al a Thr 
375 

Ser Leu Pro Arg Ser 
390 

Glv Ala Arc Asp Gly 
405 



Phe Leu Arg Arg lie Ser 

3 5 0 

Glv Tvr Asr. His Thr Thr 
365 

Arg Val Ser Ala Thr Lys 
3B0 

Thr Val Leu Ala Met Val 
395 400 

Ala Val Pro Ser Ser lie Leu Glu Gly 



Glr. He 



Val 
Ala 



Ser Glr. Giu 



Pro 
410 



lie Ala Met Val Val Giu His Met 



Tvr 
425 



Ala Tyr Thr Tyr V £ 
43 0 



Gl"-* Asr Thr G 



Glu Arc Lvs Leu Met 



Asr Ser Cys Pre Pre 
455 

-v.- 7-— L,eu Met Phe 

4 70 

Met He Ala Arg Pr.e 
4S3 

Phe Ser Pro Cys Phe 



Lvs 
440 

Lvs Asc Ser 



Leu Asp lie His 
44 5 



Tr.r Val 



Thr Ser Met 



Ser Lvs Pre 
4 90 



Gly Val Ser Giu Lys Leu 
4 6 0 

Cvs Thr Asr. lie Giu Leu 

475 480 

Asr Ser Leu Asr. lie Tyr 
4 55 



Leu 



5 0 5 



Leu Arg Tyr Asp Leu His Pre 



Ala Lvs Leu r.r^ ^ 



Arc Ala G 



lu Ala Pre Glr. Ser Ser Ala Leu Tr.r Arc - r.r 



5 3 0 

His Leu 
5 4 r 



, n~ \' * T * U - c o - 



Asu Ser Leu Asn Leu 
550 



.a Asr Lvs lie lie 



lie 



Phe Ala Giu Leu Leu His A- a Leu 
54 0 

Pro Ala lie Asr. Cys Ser Lys lie 
5 55 56 ' : 



Thr Val Pre Leu Pre His Val Thr Tyr 
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565 

5B0 



570 57£ 

565 



He He Ser Ser Glu Ala L«u Ser Asn Ala Val Val Tyr Glu Val Ser 

5B0 5E:) 

G-u He ?he Leu Lys Ser Ala Me: Phe He Ser Ala He Lys Pre Asp 
555 500 



:ys ser Gly Phe Asr. Phe Ser Gin He Asp Arc His He Pre He Val 



£10 



Tvr ^r. He Ser T,r Pre Arg Ar= Gly Cys Pre Leu Cy, Asp Ser Va. 
:le k«-. Ser Tyr As? Glu Ser Asp Gly Leu Glr. Ser Leu _ 



Si: 



. Va - 31- zr.r Asr. Leu Phe Leu Asp Lys Ser Fr: Phe 

*" *~ 6 6 0 " "5 S7C 

Phe Asp Asr. Asr. As, Leu H:s lie K,s Tyr Leu Trr Leu Ar= Asp As, 

va - Giu lie Arc Giv Met Tyr Arc Ar= Ar= Ala Ala Ser 

a Leu Phe Leu lie Leu Ser Phe lie Gly Phe Ser Gly Val lie Tyr 

■7 1 ri 71d ' - ^ 



705 7 ^ 715 

Phe Leu Tvr Arg Leu Phe Ser He Leu Tyr 
725 730 

::: information for seo id no:18: 
ceduskce characteristics: 

■:a LENGTH: 121= base pairs 

'H 1 TYPE: nucleic acid 

;C STRANOEDNESS : single 

;c TOPOLOGY: linear 

; i: KOLECVLE TYPE: DNA (genomic) 

.111' HY? 2THETI CAL : N 

; ;v Aim -SENSE: N 

;:x FEATURE: 

NAME -'KEY: CDS 
(5 LOCATION : 1 . . 1215 
; 2 C THE?- I Nr ORMAT I OK : 

:>:i SEI'JEKIE DESCRIPTION: SE2 ~- NO: IE: 

. — - -A~ GTG AAG GOT ACT CTA OTA GAG GCC GCG GCG 

rS- lr~ Val Pro Asp Val Lys Ala Ser Leu Val Glu Gly Ala A-a 
1 5 

G3C GAG C3C GTG TTT CAC GTC TTG ACC TCT CCG GCG 

Ari Leu Sir Tnr Gly Glu Arc Val Phe His Val Leu Thr Ser ,rc --a 

-? ?T a «? Sal SJ val SeT S Val S 5£ frS S 

TTG TTC GAA AAG TTT GGG ACT CCG GAC TCG TCT ACC GTG CCA CTC TAC 
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Leu Giu Lvs Phe Glv Thr Pro Asp Ser Ser Thr Leu Pre Leu Tvr 

5 0 5= 6 0 

GCG GCT AGG CAC GCG GAA CTA TCG TTG CTA CGG ATT ATG CTC TCA CCG 
A' £ Ala Arc His Pre Glu Leu Ser Leu Leu Arc lie Me: Leu Ser Pre 

£= " 7 0 75 EC 

CAC CCC TAG GCG TTA AGA AGC CAC TTG TGC GTA GGC GAA GAG ACC GCA 
His Pre Tvr Ala Leu Arr Ser His Leu Cvs Val Giy Giu Glu Tr.r Ala 

6 5 s: * 9 5 



. 



;C TTG CAC ZZZ AAG CCA GTC GTA CGC GGC GAT GA; : :• t 

Ser Leu Civ Val Tyr Leu His Ser Lys Pro Val Val Ar= Gly His Glu 
10C ' 105 1IC 

TTG GAG GAT ACG GAG ATA GTA GCG GAG TGG CGG CTG GGC ATA AC 3 AGG 3 54 

Phe Glu Ast Tr.r Glr. lie Leu Pro Giu Cys Arg Leu Ala He Thr Ser 
115 12 0 * " 12 5 

GAT GAG TCT TAT ACC AAG TTT AAG ATT ATA GAT ZZZ CCA GCG GGA TGG 421 

Asr Glr. Ser Tvr Thr Asr. Phe Lvs lie lie Asr Leu Pre Ala Glv Cvs 
13: ' 125 * ' 14C 

CGT CGC GTC ZZZ ATA CAC GCG GCG AAG AAG CGT GTC GTC ATC GAC GAG 48C 

Are Arr Val Pre lie His Ala Ala Asr. Lvs Arc Val Val He Aso Giu 

145 " 150 * 155 ~ 160 

GGC GCC AAC CGC ATA AAG GTG TTT GAC CCA GAG TCG CGT TTA GCG CGT 526 

Ala Ala Asr. Arc lie Lvs Val Phe Aso Pre Giu Ser Pro' Leu Pre Arc 
165 * * 170 175 

CAC ZZZ ATA ACA CCC CGT GCC GGT GAG AGC AGA TCT ATA CTG AAA CAC 5 76 

His Pre He Thr Pre Arg Ala Giy Gin Thr Arg Ser lie Leu Lys His 
160 185 " 190 

AAC ATC GCA GAG GTT TGC GAA CGG GAT ATC GTG TCA CTT AAC ACA GAC 624 

Asr. lie Ala Glr. Val Cys Giu Arg Asp He Val Ser Leu Asr. Thr Asp 
195 ' 200 205 

AAC GAC GCC GCG TCT ATG TTC TAG ATG ATT GGA GTC AGG CGG GCG AGA £72 

Asr. Glu Ala Ala Ser Met. Phe Tyr Met lie Gly Leu Arg Arg Pro Arg 
2i: 215 ' * 220 

ZZZ GGA GAA AG 2 CGG GTC TGT GAC TTC AAC ACC GTT ACC ATC ATG GAG 72 C 

Leu Glv Glu Ser Pre Val Cys Asp Phe Asn Thr Val Thr He Met Giu 

115 * 230 ' 235 240 

CGT GCT AAC AAC TCG ATA ACT TTT CTA CCC AAG CTA AAA CTG AAC CGG 76 6 

Arc Ala Asr. Asr. Ser He Thr Phe Leu Pre Lys Leu Lys Leu Asr. Arg 
245 25C * 255 

CTA CAA CAC CTG TTC CTG AAG CAC GTG TTG CTG CCC ACC ATG GGG CTG BIS 

Leu Glr. His Leu Phe Leu Lvs His Val Leu Leu Arg Ser Met Gly leu 
26: * 2£5 270 

3AA AAC ATC GTC TCG TGT TTC TCA TCG CTG TAC G2-Z GCA GAA CTT GCC 6 64 

Glu Asr. He Val Ser Cvs Phe Ser Ser Leu Tvr Glv Ala Giu Leu Ala 
"5 26 0 255 

ZZZ GCC AAA ACA CAC GAG CGG GAG TTC TTC GGC GCT CTG CTA GAA AGA =12 

Pre Ala Lvs Tr.r His Giu Arc Glu Pr.e Phe Glv Ala Leu Leu Glu Arg 
19: * 293 300 

GTC AAA GGT CGG GTG GAG GAC GCG GTC TTC TGC CTG AAT ACC ATA GAG 96 0 

Leu Lvs Arc Arc Val Giu Ase Ala Val Phe Cvs Leu Asr. Tr.r He Giu 

305 * " " 310 * ■ 315 32C 
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Le — .r.r Va_ 



INFORMATION "OF. SE~ H> NO: IS: 

: - > SEQUENCE CHARACTERISTICS : 

(A- LENGTH: 4 04 amino acids 
(a) TV PE : ammo acid 
CD) TOPOLOGY : linear 



{hi MOLECULE TYPE: protein 
v; SEOUENCE DESCRIPTION: SEC 10 NO: 19: 



Met Leu Arc \a. 



.re se: 



z>- 0 asd Val Lvs Ala Ser Leu Val 
5 ' 10 



Glv GIu Arc Val Phe His Val Leu Tr.r 
25 



Ser 

3C 



Aia A-.a 



Pre Ala 



Val A" a A" a Me- Val Glv Val Ser Asn Pro Glu Val Pre 

Leu Pr.e Glu Lys Phe Gly Thr Pro Asp Ser Ser Thr Leu 
- -. » » 6 'k 



Met Pro Leu 



Pro Leu Tyr 



Ala Ala Arc His Pro Glu Leu Ser Leu 



Leu Arc lie Me: Leu Ser 



Pro 
8 0 



His Pre Tvr Ala Leu 



u Arc Ser His Leu Cys Val Gly 



vj_U o - U - o 



'c«- Val Tvr Leu His Ser Lys Pre Val Val 



Phe Glu Asp Thr Glr. He Leu Pro 



t. v _ 7-r Asr. Phe Lys He He Asp Leu Pr 

- ~ - ill 



Cvs 



v-a 1 P-c He Kis Ala Ala Asr. Lys Arc Val Val 



I.e Aso 



Glu 
16 C 



Ala Asr. Arc He Lys 



Val Phe Asr Pro Glu Ser Pro Leu Pro Arc 
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21r 

1€5 - 5 

«-s P— He Thr Pro Arc Ala Gly Glr. Thr Arc Ser He Leu Lys His 
1B0 1E5 

As- -■- -la Gin val Cys Glu Arg Asp He Val Ser Leu Asr. Thr Asr 
" 195 2 0C 2 0 = 

Asr. Glu Ala Ala Ser Met Phe Tyr Met lie Gly Leu Ar? Arg Pro Arg 
L-u Glv Glu £er Pre Val Cys Asp Phe Asr. Thr Val Thr lie Me, Glu 

- 25 ' 23: 

Ar= Ala Asr. Asr: Ser He Thr Phe Leu Pro Lys Leu Lys Leu Asr Arg 
24 5 -=»C 

C His Leu Phe Leu Lys His Val Leu Leu Arg Ser Met Gly Leu 
— — 260 265 -70 

Glu Asr. lie Val Ser Cys Phe Ser Ser Leu Tyr Gly Glu Leu Ala 

2 - 5 2 8 0 * = - 

^ — - His Glu Ar= Glu Phe Phe Gly Ala Leu Leu Glu Arg 

2sz ' 255 300 



Leu 
3 05 



*vs A-= A-= Val Glu Aso Ala Val Phe Cys Leu Asr. Thr He Glu 

- 3 , 0 315 3 2 0 

Asr Phe Pre Phe Arg Glu Pro He Arg Glr. Pro Pro Asr Cys Ser Lys 

325 330 335 

va' Leu He Glu Ala Met Glu Lys Tyr Phe Met Met Cys Ser Pro Lys 

°* - 34 _ 2S0 

^ c 3,,- Ser Ala Ala Trp Leu Gly Ala Gly Val Val Glu Leu He 

— As- "-" v Asr. Pro Leu Ser Glu Val Leu Gly Phe Leu Ala Lys Tyr 
37 'c 375 380 

M o- r>~- * Glr. Lvs Glu Cvs Thr Gly Asr. Leu Leu Lys He Tyr Ala 

3 65 "3 90 3 55 4 00 

Leu Leu Tr.r Val 



t KF: . ? vjl-io:: for sec :d nc;20: 

SZw'JiNCE CHARACTERISTICS: 
(A LENGTH : 2255 base pairs 
{ c • TYPE: nucleic aoic 
(2; STRAirCEDKESS : single 
!2>: TOPOLOGY: linear 

:n KCLECCLE TYPE: DNA (ger.or.it ; 

Airri - SENSE : K 

( ix r EATURE : 

(A) NAME / KEY : CDS 

( E ) LOCATION: 1 . .2259 

(2) ! OTHER INFORMATION : 
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;v : .. SS3-JSN=£ DESCRIPTION: SEQ ZD NT: 2 0: 

— - G~ G~G ~TG GAG GST CCC CTA CTA TTC CCA TCG AGT GCG TTC CTG 

Ala* Ala Leu Giu Gly Pre Leu Leu Leu Pro Pro Ser -a Ser .eu 

_ 5 10 — 

- — . A"" AG" TTG CAG ATT ATG TTT TAT CAA GTG ATT TGG GAA TTA CAT 

~~ Se"r Pro Gir. Tr.r Tr.r Tys Tyr Gin Ala Tr.r Trp G.u Ser G.r. 

- -r~- GTG GTT AC 3 AAT TTG TAT TTT TAG GTA GAG 

^ g~ liZ Pie Tvs Cys Leu Ala Thr Asr. Ser K.s Leu G.r. Ala Glu 
"""35 * 4C 

, A GAA GGT TTT GAT AAG ATG ATG TAG TTC GAG TTC AGG TTT 

r;r: -I: ^ G~ Glv Leu As? Lys Mer Me: Gin Pro Glu Pro Tr.r Pne 

"V; " 5 5 6C 

~~ ^ GCG ATA CGC AGA CTA TTT TTG GGG GAA CGT TTC CAC 

d-;; Ala" Tvs Arc Ala He Arc Arg Leu Leu Leu Gly Glu Arc Leu His 

-V; 7C 75 8C 

„. -aa GAA G3G ATT TTT TTG GGA AAA GTG GGT TGA TGG 

Z LS Gin Glu Gly Tr.r Leu Leu Gly Lys Val G.y Arc Arg 

'"" "* 65 90 5 = 

C-G- GAA GGT TTA ATA ATT GAT GGT GGT GGA GTG TTT ATG TGT 

fvr Ser Gly Glu Gly Leu He He Asp Gly Gly Gly Val Phe Tr.r Arg 



10C aos 

GGA CAG ATA GAT ACC GAT AAT TAT TTA CTT GCG GTG GGA TTA TGG GAA 

T?Z ~- ~ ~ Aso Tr.r Asr> Asr. Tyr Leu Pro Ala Val Gly Ser Trp G.u 

"* 115 * 120 125 

, a — GA"" G>~ TGT GAT AAA CCC TGC GAA TTG AGG GAG TTA CGG TTG 

"I" ~~Z - e - aso Tvs Asd Lvs Pro Cys Giu Phe Arc Glu Leu Arc Ser 

HI ------ 5 14 C 

, , GCG CTA TTA ATG TGC ACC ATA TGT TAG AAA GTG ATG 

-f:; £ e " Ala Leu Leu Tr.r Cvs Thr lie Cys Tyr Lys Ala Met 

7.: — ' 15 C 155 ISC 

AG" A "A G"G TGG AGG TAT TTG GAG TTG TGG GAG TTC GAA CAG TGT 

?r.e Arc lie Val Cys Arg Tyr Leu Glu Phe Trp Glu Phe Giu Gin Cys 



l£5 



170 " - I "? 5 



, ~-~ ■ — G GCG GTG TTG CCC CAT AGT CTA CAA CCG ACA ATC 

52s Ala ?r.e Leu Ala Val Leu Pre H.s Ser Leu Gir. Pro Tr.r He 
ISC 165 -9- 



GGA GTG TTG GAG AGG GTG AAG CAT CT! 

IT'.! -'"I' a-- ^ Phe Ala Leu Leu Glu Ser Leu Lys K.s Leu Ser Pne 

-I; ' 2 0C 2 05 

_ — ^ — GGA TCG CCA GAG GGA CAG CTA CAT TTT TTA AAG 
:;C C;;: Pro Ala Ser Pro Asp Ala Gir. Leu K-£ ?he Leu Lys 
215 ~ 



- « - AG ATG GC2 AGG TGG GGG TGG CAG GGA GAG TTG 

Pne 
225 



Phe ^ Ser Ser Phe Me: Ala Tr.r Trp Gly Trp K.s Gly Glu Leu 



— G C3r C3T GCG ATG GTT CAG AAC GTA GAG GGA CTG CCC ACC 

val sir Leu Arc Arg Ala lie Ala K.s Asr. Vai Giu Arg Leu Pro Tr.r 

24 5 25C ' 

GTG CTG AAG AAG TTG TCG AAA CAG AGT AAG CAG CAG GAG GTG AAG GTT 



28 = 



3B4 



432 



48 = 



52S 



57£ 



765 



Elr 



pcm : s95/ioi<M 
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val Leu Lvs Asr. Leu Ser Lys Gin Ser Lys His Gin Asp Va! Lys Val 

26 o 



25 C 

rG3 GA - G TG gtg GGG TTT CAG CTG GCT CTA AAC CAG CTG GTG 

^ S~ A~ Asp Leu Val Gly Phe Glr. Leu Ala Leu As, G_r. - Va. 



:~5 

^ G G AG GTA AAA ATT CAA CGG AAG GAG CGC GGA CGA AAG CCA 

Ser A~ Leu His val Lys lie Gin Arg Lys Asp Pre G.y Pre -ys Pre 

-A" AGG GTG GTG GTG AGT ACG CCA GAT TGT AGG TAG TAT GTA GTG TAT 

" ~ Val --- -~_ . . 

305 31C 315 



^3 7-- val Val Ser Thr Pre As? Cys Thr Tyr Tyr Leu Val Tyr 

Sbi = 

-~ rrG G GG ATG TAG AGA GTG GTG ATG TGT ATG GGA GTG GGA 

g v ~ :J p-= — a :le T " r Ars Leu Val M£ " Cy£ Me ~ Ala - : "~ a 

330 = 



325 



_ »r G - GGG GAG TGG TGG AGG GGA GTG CAG GGG TGG GGA AAG TTT 

1 Glv Kis Ser Cvs Ser Gly Leu His Pro Cys A.a Asr. Pne 

w - ~ * 345 35 0 



34C 



GA C- AGA GGG CGT CTG CTG GGG GGG AGG CTT 

G r;. T ZZ : I- £ G iu Thr Pro Arc Leu Leu Ala Ala Tr.r Leu Ser Arc 



255 



360 3£5 



„ G - GGG GGG AAA GAG GGG CGA GCA GGG ATG AAA GGA AAT TTG 

-1- A-5 Tvr Ala Pro Lys Asp Arg Arc Ala Ala Mer Lys G:y Asr. -e- 
* * 275 * 375 3B0 

"AG G-" TGG TTG CAA CGA TAC GGG GGG AGG GAG GGG CGG ACT GTG GGG 

5- l"a Cvs Phe Gin Arg Tyr Ala Ala Thr Asp Ala Arc: Tr.r Leu oly 

3-' *• 390 3 95 4 00 

A=A 3-G TC a GAG ATG CTG GAA CGG AGA AAA CAG GTG AGT TTG 

ell — va- Ser Asd Me: Leu Glu Pro Thr Lys His Val Ser Leu 
" 405 " 410 415 

_ AAG A TC AGG ATA TTG AAG AGG AAG ATG GTG ATT AAG AGT 

°~ o^I t V£ -i» T.-r lie Phe Asn Thr Asr. Met Val lie Asr. Tr.r 

X'2Z 425 430 

_ -~ — - GTT CGT AAC ACC CTG CAA AAG ACT ATT TTA AAG 

- " - : 2 cvs His Val Pro Asr. Thr Leu Gin Lys Tr.r lie Leu Asr. 

435 440 445 

AGG AAG AAT TTT GTT ATA GGA AAG TAG TGG GTA AAG 

a-= Leu Thr Asn Asa Phe Val lie Arg Lys Tyr Ser Val Lys 
" \\1 ' 455 460 

, AGG ATA AGG GTG TTT TTT TGG GAG AAG ATG TGT GA« 

~~ Thr Gle Ser Val ?ne Phe Ser Asp Asn Me: Cys ^_r. 

Ill " 47C 475 4B. 

AAG ATG A=,G ATG AGT GGG GAG ATG GTG GAG TTT GTG 

G- — Al2 G^e" £r. Gle Asn lie Ser Gly Asp Me: Leu His Pne -eu 

AGG CTG AAA TGG TTT CTG CCA ATG AGG GAG ATA TTT 

?Ae XI2 Gly T,r Leu Lys Cys Ph ? Le, Pro lie Arg K,s -e ?.,e 

50: 50= 

— GTA TGG ATA GCA AAT TGG AAC TGG ACG TTG GAG CTG CAG GGA CTG 
Pro VaL Ser lie Ala Asn Trp Asn Ser Thr Leu As? Leu his «... 
515 52w 



1200 



124S 



1296 



1344 



148S 



15B4 
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~ -"rr: ti" ATG GTG AGA ATG GGG CGA AAA AAG G" 



r\ - - - 



/-v 



^ 3"" M-t Val Arc Met Glv Arg Lys Asr. Val Phe Trp T: 



= 5 w 



— — ^ GTG GTG TCv «o _ nno -.^ >j - «- - — 

el- Val Val Ser Ser Lys Asp Gly Leu Asr. Val Ser 
S5G 555 5c: 



^r%_ . - 

Thr Asr. ?.ie Pr: 
54 5 

GZZ GGG AGA GGG AGG ATT TTT AAA GTG TAG ~- 

.0-- ~- — ^ V s Val Tvr G-"-* G.r. ?r: 

~_ p>> e LV£ rt-S . - 

56 = 



GAG GAG GTG GGG GGG ATT C.- ^ 
His Glu Leu Ala Prr He Leu _r.r 
5E5 ssz 



T ^ r ".s r G — r. 



H - 5 A- a 



-3 - A*"" GAG GGA AAG AAA AAT AGA ATA - - - _ - 

as- Glv Asr. Lvs Asr. Arc He Phe Ser Leu „eu 

-i= " SOD SC5 



TA GAG AGG GTA GAG AAA AGG . - - w . « «~^> 



His Arc As- A r g 



TG- wrtrt AiA U-J-KJ /-i^U 

G" - He Glr. Tr.r Leu His Lys Arg 

62C 



Phe Leu G_u 



Cys _eu 
£ 2* 5 



■^3C TGT TGG TTT GTG AGG C. * GAG GTG GGT 
~-~ ■ r- vs cys Ser Phe Leu Arg Leu Asp Val A- a 
* 63C 635 



Gvs He 



A"" -G=> GGG GGG GGG GGG GGG CTG TTT GAG TTG TGA AA< 
-I lr= Ala Ala Ala Arg Gly Leu Phe Asp Phe Ser Ly 



s Lvs He He 



= ~— ?:r ~ = - esc 6=5 



645 



AGG AAA CAG GAG TGG GGA GTA GTG GGA 
S*»- Lvs His Glu Cvs Ala Val Leu Gly 
~~ " 665 S70 



AG. GAT AGT AAA _ , ^_ , T _, T _ ^, _ 

=-er His Tr.r Lvs 
66C 



TGT AA_ - - w^-j 



ATG TAT GGG GGA AAG AAG AAG AGG AGG 
Pre Lys He Tvr Ala Arg Asr* Lys Lys "3" 

6*eo ees 



-* - GA™ 1 GGG GGG AAT GGA AAG TTG A.- TGu G._ 

7~J G _v Ar= Asr. Ala Asr. Phe He Ser Phe Va. Aia .r.r 



6 95 70C ' 



7 GTA AAG GGA GAA A. . GTG GGT 

Glv his Arg Pne Ala Ala Leu Lys 



GG. ----- ^" . . pro Gln Iie Val Ar r His Ala 



3 G 7 AAA. ^ 00- - - — 



TGG GGC GAG CGA AGG GGG GGG TGI 
Trx Arg His Arg Tr.r Ala Ala Ser As- Giu 

73C 735 



-A" PGA GGG GGA GGG GAT GGC GGG GTA GGT TGG GTG GG. GZG GTG « r 
Glr. Tr.r Pre ?r^ Ala Asp Pro Arg Val Arc Gys va. ~rg va. 



- c z - 



166: 



1672 



isr: 



156£ 



AAG 2016 
Lvs Lvs 



206^ 



216C 



220E 



IKFOPJ-'.ATIGi; FOr. SE; 13 KO:21: 

ii> SSOUSKGi GKAKAGTER.I ST I GS : 

( A 1 LENGTH : 7 52 amir.: acids 
(3; t;?I: amino an- 
;E - TOPGLOGV: linear 
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Iii) MOLECULE TYPE: prore^r. 







i': SEQUEKCE DESCF.1PT 


I OK : SEw 




NO : 2 












Met ; 


Ala - 


Ala Leu Glu Gly ?ro 

c 


Leu Leu 


Leu 

10 


Pro 


Pro 


Ser 


A— a 


Ser 


Leu 


Thr ' 


Thr 


£er Pre Glr. * — 


Cys Tyr 




Ala 


* .n 


Trp 


\jl U 

3 : 


Ser 


G ir. 


L-eu 


Glu 


~le Phe Cvs Cys 1s- 


Ala Thr 
4 C 


Asr. 


Ser 


His 


Leu 


G_r. 


Ala 


u — u 




5 C 


vo-j G 1 — G — y ■J-'S — p 


Lys Met 


Met 


u in 


Pre 

s: 


\j — u 


Pro 




Phe 


Phe 


A_ a 


~vs Ar- Ala lie Arg 

7 Z 


Arg Leu 


Leu 


Leu 
75 


Gly 


Glu 


Arg 


Le u 


His 

6 :■ 


Pr 0 


Phe 


lie His Gin Glu Gly 

es 


Thr Leu 


Leu 
9G 


Gly 


Lvs 


Val 


Gl v 


Arg 
c = 


Arg 


Tvr 




ic: 


lie Asp 


Gly 


Gly 


Gly 


* a — 


Phe 


Tr.r 


y-i.r g 


•J 1 V 


G-.r. 


lie Asp Thr Asp Asr. 
115 


Tvr Leu 
120 


Pro 


Ala 


Val 


G i y 


Ser 


T ro 


\j 1 u 




13 0 


Asr Ast> Cvs Asp Lys 
135 


Pro Cys 


Glu 


Phe 


Arg 
140 


Glu 


Leu 


Arg 


Ser 


Le U 




L*»u Pro Ala Leu Leu 
150 


Thr Cys 


Thr 


lie 
15 5 


Cys 


Tyr 


Lys 


Ala 


Met 
160 


Phe 


Arg 


lie Val Cys Arc Tyr 
165 


Leu Glu 


Phe 
17 0 


Trp 


oi-U 


Phe 


Glu 


G ir. 

175 


Cys 


Phe 


H i. s 


Ala Phe Leu Ala Val 
ISC 


Leu Pre 
185 


His 


Ser 


Leu 


Glr. 


Pro 
ISC 


Thr 


lie 


Tvr 


o — r. 


Asr. T-.-r Phe Ala Leu 
1 9 z 


Leu Glu 
200 


Ser 


Leu 


Lys 


His 
2 0 5 


Leu 


Ser 


Phe 


Ser 




Me_ Pre ?r: Ala Ser 
115 


Pro Asp 


Ala 


Glr. 


Leu 


His 


Phe 


Leu 


Lys 


Phe 


Asr. 


lie Ser Ser Phe Met 
2 3 Z 


A_ a Tr.r 




Gly 




His 


G i y 


G — u 


Leu 
2 4 C 


N. C. — 


Ser Leu Arg Arg Ala lie 


Ala His 


Asr. 

25: 


val 


G — u 




Le u 


Pro 




V a. - 


lie *«. 


Lve Asr. Leu Ser Lys Glr. Ser 
* 25: 255 


■ Lys 


Hi s 


G_r. 


Asp 


Val 
2 7 C 


Lys 


Val 


As: 






• Phe Glr 
280 


. Leu Ala 


i ^e — 


A.sr. 


_r. 




* a_ 



3 0 5 
Pro 



Arg Leu His Val Lys 
2 5 : 

A-= Val Val Val Ser 
31C 



Glr. Ar- Lvs Asp Pro 
255 " * 3C: 

Thr Pro Asp Cys Thr Tyr 



Glv Pro Lys Pro 



Tyr Leu Val Tyr 
3 2 0 



Thr Pro Ala lie Tyr Arg Leu Val Met Cys Met Ala Val Ala 
32 5 330 
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As r> Cvs He Giy Kis Ser Cys Ser Gly Leu His Pre Cys Ala Asr. Phe 
34C 34d 

v-c G'u Th- Pro Arc Leu Leu Ala Ala Thr Leu Ser Arc 
beU 353 H " ~ ' 360 3cr 

T-. e Arc Tvr Ala Pro Lys Asp Arg Arc Ala Ala Met Lys Gly Asr. Leu 
— VI: * 375 380 

Glr. Arc Tyr Ala Ala Thr As? Ala Arg Thr Leu Gly 

3i*5 " 

Ser Ser Thr Val Ser Asp Met Leu Glu Pro Thr Lys His Val Ser Leu 

„. , e _ , vs «hr He Phe Asn Tr.r Asr. Me- Val lie Asr. Tr.r 

G-U ~S.. rT.tr w\S — 43 p 



Lys lie Ser 



4 5 C 



* s Val Pro Asn Thr Leu Gin Lys Thr He Leu Asr. 

440 * 45 

as- As- Phe val lie Arg Lys Tyr Ser Val Lys 

4=5 46C 



s — Thr He Ser Val Phe Phe Ser Asp Asn Me: Cys Glr. 

b " 470 475 48C 

Gly Thr Ala lie Asr. He Asn lie Ser Gly Asp Me: Leu His Phe Leu 
Phe Ala Me- Giy Thr Leu Lys Cys Phe Leu Pro He Arg His He Phe 

r- n r\ ^ n ^ 3 W 



50 



r>— va 1 S— Ala Asn Trp Asn Ser Thr Leu Asp Leu His Gly Leu 

' Sll ' 520 525 

n c - r--- "v- Me: Val Arc Met Gly Arg Lys Asr. Val Phe Trp Thr 

535 S40 

* c - p«» t> — ser Val Val Ser Ser Lys Asp Gly Leu Asn Val Ser 

t>«. Lvs Ala Ala Tr.r Ala Thr He Ser Lys Val Tyr Giy Gin Pro 

- 565 570 373 

r i- *i e A-c His Glu Leu Ala Pre He Leu Thr Asp Glr. 
.eu — G.» 5e _ 590 

as- Giv Asn Lvs Asr. Axg He Phe Ser Leu Leu Glu 
H_s — ebo 605 



• Gin He Gin Thr Leu His Lys Arc Phe Leu Glu 

~" r " 615 6=C 



His Arc As: 
61C 

- s Leu Val Glu Cys Cys Ser Phe Leu Arc Leu Asp Val Ala Cys He 
£ l - 6 3 0 6 J = 

- -. =, *- p t"a Arc Glv Leu Phe Aso Phe Ser Lys Lys He He 
Arc Arc ~-a ^.-a A*a a- 3 - £35 

645 € => c 

£ _ — lvs Ser Lys His Glu Cys Ala Val Leu Gly Tyr Lys Lys 

£ " 660 ™S S - 

Cys Asn Leu He Pro Lys He Tyr Ala Arg Asn Lys Lys Thr Arg Leu 



Asp 



675 

Glu Leu Giy Arg Asn Ala Asn Phe He Ser Phe Val Ala 
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690 695 700 

G-v "-s Ar= Phe Ala Ala Leu Lys Pre Gin He Val Arg His Ala lie 
70= " 71C 71 = 

Ar= Lvs Leu Gly Leu His Trp Arc Kis Arg Thr Ala Ala Ser Asr. Glu 
725 730 • — 

t~ ?ro pro Ala Asp Pro Arg Val Arg Cys Val Arg Pre Leu Val 
74 C 74 

:: information for SE2 id no:22: 

= i SEQUENCE CHARACTERISTICS : 

(A! LENGTH : 364 base pairs 



vE 



TYPE : nucleic acid 
STRANDEDNESS : single 
TOPOLOGY: linear 



MOLECULE TYPE: DNA (genomic) 

(ill- HYP OTKET I CAL : N 

j;v; ANT! - SENSE : N 

i ix ■ FEATURE : 

(A) NAME /KEY: CDS 
(3; LOCATION : I . .3 64 
(D) OTHER INFORMATION: 

ixi SEDuEKCE DESCRIPTION: SEQ ID NO: 22: 

. C"A ACC GAG GCC GAG GTT AAG AAA TCC CTG AG 2 AGG CTT 

m«- Val Arg Pre Thr Glu Ala Glu Val Lys Lys Ser Leu Ser Arg Leu 

' 5 10 1 - 

"A G" G"A CGI AAA AGA GCA GGT AAC CGG GCC CAC CTG GC2 AC 2 TAC 

Pre Ala Ala Arc: Lys Arg Ala Gly Asn Arg Ala His Leu A^a Tr.r Tyr 

20 * 25 3C 

■ — C AAG TAC TCC ACC CTG CCC GAT CTA TGG CGG TTT CTA 

:ZZ JZ- - « u Leu L vs Tvr Ser Thr Leu Pro Asp Leu Trp Arg ?ne Leu 

, c— CAG AAC CCT CCC CTT GGA CA2 CAC AGA TTA TTC TTT 

A— Pro Gin Asn Pro Pro Leu Gly His His Arg Leu Phe Pne 

~ GGG CAC AGA ATT GCC GAC TGC GTA ATT CTG GTA TCG 

G~ • Va 7 Tr.r Leu Glv His Arg He Ala As? Cys Val He Leu Va. Ser 

i= 7C 75 80 



SG" GG- "A" CAG CCC GTA TGT TAC GTT GTA GAG CT2 AAG ACT TGT CTG 

3":. g"" - s G~- Pro Val Cvs Tvr Val Val Glu Leu Lys Tr.r Cys „eu 

AG" "A" CAG CTG ATC CCA ACC AAC ACC GTG AGA ACG TCA CAG CGA GCT 

sir hTs Sin Leu He Pro Thr Asn Thr Val Arg Thr Ser Gin Arg ~a 



100 



105 HO 



CAA GG2 CTG TGC CAA CTC TCC GAC TCG A 
G* - G'v Leu Cvs Gin Leu Ser Asp Ser 
* 115 * -2 0 



4c 



96 



14- 



24 0 



28 = 



336 



36^ 
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(2: INFORMATION- FOR SEC 10 NO: 23 : 

{-) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 121 ar.ir.o acids 
(3J TYPE : ammo a:id 
(D? TOPOLOGY : linear 

iii! MOLECULE TYPE: protein 

ir.i' SEQUENCE DES CRIPTION : SEZ IC NO:22: 

Met val Arg Pre Thr Glu Ala Glu Val Lys Lys Ser Leu Ser Arg Leu 

r>-~ "a i" a A-= Lvs Ar= Ala Gly Asr. Arg Ala His Leu Ala Tr.r Tyr 



25 " 3C 

A-= Ar= Leu Leu Lys Tyr Ser Thr Leu Pro Asp Leu Trp Arc ?he Leu 

35 40 HS 

co^ co- a— Gin Asr. Pro Pro Leu Gly His His Arg Leu Phe Phe 

G -. u va" Tr.r Leu Glv His Arg He Ala As? Cys Val He Leu Val Ser 
I. * ' 70 75 

Glv Glv h:s Gir. Pre Val Cys Tyr Val Val Glu Leu Lys Thr Cys Leu 
6 5 ^ ^ 

Ser His Gir. Leu He Pro Thr Asr. Thr Val Arg Thr Ser Gin Arg Ala 
ioc 105 - 1D 

--. - G'v Leu Cvs Gir. Leu Ser Asp Ser 
115 ' 120 

;i: INFORMATION FOR SEQ IO NO: 24: 

S E QUSN CE CHARA CTER ISTICS : 
iA- LENGTH: 91 e base pairs 
■ - ■ TYPE: nucleic acid 
(I.- S7RANDEDNESS : single 
;r: 7 OP 2 LOGV : linear 

M2LECULE TYPE : DNA (genomic) 

i v Aim - SENSE : N 

;v FEAT~Jr.E : 

;A • NAME / Fit V : COS 

{ B : LOCATION: 1. .916 

{2.' 2 THE?. INFORMATION: 



i>:i S 



EQUEKCE DESCRIPTION: SEQ ID NO: 24 



4E 

cTT AA'_ - - - ^~ - - 

_ _. _ , - „ . • ^'na 

Leu Asp Lys Ser He Val 



, GA*~ AAG ACT ATA GTG GTT AAC TTC AC 2 TC2 AGA C._ 

o -- Vss - Val as" Phe Tr.r Ser Arg Leu Pne 

1C 



... G ~- G — CTT CAG TCA AAA ATA GGG AGC GTA CTG CCG 55 

Ila IZi G~ Leu" Ai; Ha Leu Gin Ser Lys He Gly Ser Val Leu Pro 
2C " 
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GGi GAT TGC CAC CGT TTA CAA AAT ATA CAG GCA TTG GGG CT3 GGG 144 

^ Cvs H^s Arc Leu Gin Asr. He Gin Ala Leu G.y «-V 

2*5 40 4r 

— ™ -g A CGT GAG ACA TCT CCG GAG TAG ATG CAA ATT AT 3 "AG IS! 

Ya ~v= Ser Arc Glu Thr Ser Pro Asp Tyr lie Gin He Met G_r. 

5 : * " 55 so 

„ ACA CTG GCT GTG TTG GAG GAG GTT CGG GGG GAT 24: 

ifl! *~1" L ,. c Cyc Thr Le u Ala Val Leu Glu Glu Val Arc ?r: Asp 

- - 7C 75 e: 

— ZZZ TTA ACG CGG ATG GAT ZZZ TTT GAG AAG GTT CAG ATA AAA ~£ = 

el- Arc Leu Tr.r Arc Me: As? Pre Ser Asp Asr. Leu G-r. --e -ys 
E5 90 5= 

. .„ ■ G -~ ccc TTT CAG TGG GAG AG Z AAG AGG GAG GTA GCA 2 3£ 

Va" Ala Pre Phe Phe Glr. Trp Asp Ser Asr. Tr.r Glr. Leu Ala 

-_r Z 105 H~ 

G _„ ~CA TTT TTT AGG CGA AAG GAT TGG AGG ATT GTG GTG GAA 3 64 

v&l Leu Pre Pre Phe Phe Ser Arc Lys Asp Ser Tr.r lie Val Leu G^u 

- - = 'ZZ 12 r 



c - GAT GAG ATG AGG AGG ATG CGG GTG ATG TTG TGG 

LI" "I: val pro Asd Asp He Thr Arc Met Arc Val Me. Pr.e Ser .yr 
" 2£C " 265 

c - r AGT TGT ATA TTT AAT GTT GGG ZZZ A3A GTG GAG GTG 

£eu Hr! ?ir Leu Ser Ser He Pre Asr Leu Gly Pro Arg Leu His ^a. 
275 280 2Bd 

^ G GGA GAG AGT TTG GGG GGG TGG TGT TGG TAT TGG CCA 

A r a Tv : Ser Ala Glu Tr.r Leu Ala Ala Ser Cys Trp Tyr Ser Pro 

**" n 9: * 25= 300 



480 



528 



576 



AA~ G3^ ™TT GAG GGG GTG TTG CGG ATG GTG GTG CGG GAG GAA CH- 

S- : As : G'v Phe Aso Pro Val Phe Pro Me: Val Val Pro Gin Gur. Leu 
13C . " * 125 140 

GGG "A" G~- ATT CTG CAG CAG CTG TTG GTG TAG CAG ATG TAG TCG AAA 

v a' a 'I* 1 Leu Glr. Gin Leu Leu Val Tyr His He Tyr Ser Lys 

Hi "" ' ISO 155 ISO 

— - G ~~ GGG GGG CGG GAT GAT GTA AAT ATG GCG GAA CTT GAT GTA 

- : 2 ^1- Ala Glv Ala Pro Asp Asp Val Asr. Me: Ala Glu Leu Asp Leu 
* 1€5 170 175 

A — GT3 TCA TTT ATG GGG CGG ACA TAT CGT CTG GAG GTA 

-11 ~ZZ As - val Ser Phe Me: Glv Arg Thr Tyr Arc Leu Asp Val 
*' 18 * C 165 190 

3-~ AAG AGG GAT CCA CGT ACT GCC CTG CGA GTG CTT GAG GAT CTG TGG £24 

As - as: Pro Arc Thr Ala Leu Arg Val Leu Asp Asp Leu Ser 

" " 155 * 200 205 

~ A — CTA TCA GGG TTG GTT CGG AGG GGG TGT CTG CGT 6 71 

If;! ~~.~s He Leu Ser Ala Leu Val Pro Arg Gly Cys Leu Arc 

ii: ~"~ * n= 220 

G - G CTr GTG CGG CAG GAG AGG CAT CGT CTG AGA GAG- GTG 72 C 

L- : ~-u T-r Ala Leu Val Arc His Asp Arc His Pro Leu Tr.r Giu ya_ 

113 23 C 

-TT GAG- GGG GTG GTG CCA GAT GAG GTG AGG AGG ATA GAT CTG GAG GAG 
Phe Glu Glv Val Val Pro As? Glu Val Tr.r Arc He As? Leu ^ 
245 2 5 0 ,- = " 



76E 



El£ 



664 



912 
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22S 



CGI 
Arg 
3 05 



INFORMATION FOR S£Z NO: 25: 

ill SEQUENCE CHARACTERISTICS: _ 
iA.- LENGTH: 3 05 atr.ino ao-as 
(H; TYPE: arr.ir.c acid 
tc' TOPOLOGY: linear 

— ■ KOLEC-JLE TYPE : prooeir. 

(xi SEOCEKCE DESCRIPTION: SE; ID NO:2E: 
- - S — Val Val As- Phe Thr Ser Arc Leu Phe 

••- As- ~-"u Leu Ai» Ai» Leu Glr. Ser Lys He Gly Ser Val Leu Pre 

v-s r-c Leu Gin Asr. lie Gin Als Leu G.y Leu Gly 

_ _ 4 u n - 

- - c*- Ser Pro Asp Tyr He Glr. He Me- Glr. 
cys va. Cys Ser A_ £0 

SO 

ser Lvs Cys Thr Leu Ala Val Leu Glu Glu Val Arc Pro Asp 

~£5 ' 70 

_ _ , _.. L ...- Tnr Aro Me: As? Pro Ser Asp Asr. Leu Glr. lie Lys 

Se- i-e = - g , - 90 S3 

~-~ Ala *ro Phe Phe Glr. Trp Asp Ser Asr. Thr Glr. Leu Ala 

^S.. .... 

.... r>h- Phe Ser Arg Lys Asp Ser Thr He Val Leu Glu 

s- Glv Phe Asp Pro Val Phe Pro Me: Val Val Pro Glr. Glr. Leu 

13 — Y^ Y ~ ~" " ~ ' "35 j. 4 G 

Glv H,s Ala lie Leu Glr. Glr. Leu Leu Val Tyr K,s lie Tyr S.r Lys 

- - = 15C --^ 

Ser Ala Gly Ala Pro Asp Asp Val Asr: Met Ala Glu Leu Asp Leu 

1£5 - 7D 

-v- _ r-:.. "■■r— Leu Asr Val 
-v.- Tnr Asr. Val Ser Pne Me: G-> ~r- * .• - ~- r 

" is: 165 

As- As- -Sir Aso Pro Arg T.-.r Ala Leu Arg Val Leu Asp Asr Leu Ser 
*s. ^_ - 20C 

Ke-- Tyr leu Cys lie Leu Ser Ala Leu Val Pre Arg Gly Cys Leu Arg 

Leu Leu Thr XI. Leu val Arg H,s As? Arg H.s Pro Leu Thr Glu Val 

- - - 23 0 

Phe Glu Gly val val Pro Asp Glu Val Thr Arg He Asp Leu Asp Glr. 

245 - Sl 
Le u Ser Val Pro Asp As ? lie Thr Arg Me, Arg Val «e: Pne Ser Tyr 
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Leu G" ^ Ser Leu Ser Ser He Phe Asn Leu Gly pro Arc Leu His Val 
z75 280 285 

Tvr Ala T%-r Ser Ala Glu Thr Leu Ala Ala Ser Cys Trp Tyr Ser Pre 
2 90 295 30 0 

Arc 

3 05 

C; INFORMATION FOR SEO. ID NO :2c: 

!i SEQUENCE CHARACTERISTICS: 

(A; LENGTH: 8 72 base pairs 
(5 1 TYPE: nucleic acic 
{C: STRANDEDNESS : Single 
[D\ TOPOLOGY: linear 

iii: MOLECULE TYPE: DNA (genomic J 

i i i i KYP CTHET I C AL : N 

(IV: Aim -SENSE: N 

( :x FEATURE : 

(A; NAME /KEY: CDS 
(3) LOCATION : 1 . . 873 
(D) OTHER INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

ATG GCG TCA TCT GAT ATT CTG TCG GTT GCA AGG ACG GAT GAC GGC TCC 4 8 

M< L- Ala Ser Ser Asp lie Leu Ser Val Ala Arg Thr Asp Asp Gly Ser 
! 5 10 IS 

GTC TCT GAA GTC TCC CTG CGT GGA GGT AGG AAA AAA ACT AC- GTw ±Aw 96 
Va* Cv = Glu Val Ser Leu Arc Gly Glv Arg Lys Lvs Thr Thr Val Tyr 
2C 25 30 

CTG CCG GAC ACT GAA CCC TGG GTG GTA GAG ACC GAC GCC ATC AAA GAC 14 4 

Leu P~c Asp Thr Glu Pro Trp Val Val Glu Thr Aso Ala He Lys Asp 
3= 4 0 "45 

GCC TTC CTC AGC GAC GGG ATC GTG GAT ATG GOT CGA AAG CTT CAT CGT 152 
A" 5 r>^c ser Asp Glv lie Val Asd Met Ala Arc Lvs Leu His Arc 

"=*C " 55 60 



G — *■" A— a ^~eu 
z = 



CCC TCA AAT TCT CAC AAC GGC TTG AGG ATG GTG --T 24 0 

o r : Ser Asr. Ser Kis Asr. Glv Leu Arg Met Val Leu Phe 
7- '7- 8 0 



. '.J * o - 



TA"~ TGT TAC TTG CAA AAT TGT GTG TAC d A G — w - ^ 
X v ; ~..l Cvs Tvr Leu Gi- as- Cys Val Tvr Leu Ala Leu Phe Leu Cys 
" ~ ~ ~ " ~ * ~ 8 5 9 0 5 = 



r 



— g — G*"G GCA CCT GAG GTG CTC TTC CCA CAC CCG GCT GAG ATC TCT 

P ;~ va'* Va 1 Ala Pro Glu Val Leu Phe Pro His Pro Ala Glu Me: Ser 
11= 120 12= 

CGC GGT TGC GAT GAC GCG ATT TTC TGT AAA CTG CCC TAT ACC GTG CCT 

A-= Glv Cvs Asd Asd Ala lie Phe Cys Lys Leu Pre Tyr T.ir Va. Pre 
- 135 14: 



28 E 



CC2 CTT AAT CCT TAC TTG GTA ACT CCC TCA AGC ATT GAC- TTT GC2 GAG 3 3c 

P— - — ■ Asr. Pro Tvr Leu Val Thr Pre Ser Ser lie Glu Pr.e A* a G:u 
IOC ' 105 



38-; 



432 
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-~A A — AA~ AC" ACG TTT GGA CGC ATT TAC CCG AAC TCT AC A CGC GAG 

lie Asr. Thr Thr Phe Gly Arg He Tyr Pro Asr. Ser Thr Arc Giu 

: 45 150 155 loC 

— GA- GGC AGG CCT ACG GAT TAC TCC ATG GCC CTT AGA AGG GCT TTT 
Asr Giv Arc Pro Thr Asp Tyr Ser Met Ala Leu Arc Arg A.* Pr.e 
1£5 170 1*5 

G~A G-T ATG GTT AAC ACG TCA TGT GCA GGA GTG ACA TTG TGC CCC GGA 
A r a Va- val Asr. Tr.r Ser Cvs Ala Gly Val Thr Leu Cys Arg G.y 

IBC 185 190 

GAA A"" ~A~- ACC GCA TCC CGT AAC CAC ACT GAG TGG GAA A." CTG CTC- 
~ - .. 1-T T - r A i a ser Arc Asr. His Thr Giu Tr? Giu Asr. Leu Leu 

19*5 " 200 205 

GCT ATG TTT TCT GTG ATT ATC TAT GCC TTA GAT CAC AAC TGT CAC CCG 
"~a M°" P^» Va* lie lie Tvr Ala Leu Aso His Asr. Cys His Pre 

211 H= " 22Z 

GAA G~- — G TCT ATI GCG AGC GGC ATC TTT GAC GAG CGT GAC TAT GGA 
3* a" a L°u Ser lie Aia Ser Glv lie Phe Ass Giu Arg Asr Tyr Giy 
z ll ' 220 135 2-iC 

^ A _ r Trr CAG ccr C3G AGG GTG CCC TCG CCT ACC CCT TGC GAC 

Phe lie Ser Gir. Pre Arg Ser Vai Pro Ser Pro Thr Pro Cys Asp 
245 2S0 255 

GTG TCG TGG GAA GAT ATC TAC AAC GGG ACT TAC CTA GCT CGG CCT GGA 
Va" S«- Trr Giu Aso lie Tvr Asn Gly Thr Tyr Leu Aia Arg Pro Gly 
" 26C * 265 170 

AAC TGT GAC CCC TGG CCC AAT CTA TCC ACC CCT CCC TTG ATT CTA AAT 
Asr Cvs Aso Pro Trp Pro Asr. Leu Ser Thr Pro Pro Leu lie Leu Asr. 
2 75 2B0 26 5 

TTT r-u-i-r-. » rtA 

Pr.e Lvs 

25 C 

:: ikfc-rkatidi: ro?. sec ic no:27: 

.1 SEC/JENCE CHARACTERISTICS: 

;A; LENGTH: 292 amino acids 
{3! TYPE . amno acid 
(2; TOPOLOGY: linear 

I::- MOLECULE TYPE: protein 

SE WHENCE DESCRIPTION: SEC. ID KO :Z~: 

mo- l*a Se~ Ser Aso lie Leu Ser Val Ala Arg Thr Asp Asp Gly Ser 

* 5 10 1 = 

Va* "vs G*u Val Ser Leu Arg Gly Gly Arg Lys Lys Thr Thr Val Tyr 
"" 2 2 2 5 2Z ' 

p— as- Thr Giu Pro Tro Vai Val Giu Thr Asp Ala lie Lys Asp 
25 40 45 

Ala Phe Leu Ser Aso Gly lie Val Asp Met Ala Arg Lys Leu His Arg 
5C 55 6C 

Cv Aia L~u Pro Ser Asr. Ser His Asr. Gly Leu Arg Met Val Leu Phe 

i'5 " 10 75 B0 



46 : 



525 



766 



816 



664 



875 
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~ vs -vs Tv- Leu Gin Asr. Cys Val Tyr Leu Ala Leu Phe Leu Cys 

o-c Leu Asn Pro Tvr Leu Val Thr Pre Ser Ser lie Giu Phe Ala Giu 

100 1°5 

P-o Val Val Ala Pre Giu Val Leu Phe Pre His Pre Ala Glu Me: Ser 
115 -20 -~ = 

G*v Cvs Asr Aso Ala lie Phe Cys Lys Leu Pre Tyr Thr Val ?r: 
* 13C ' * 135 1«C 

lie lie Asr. Thr Thr Phe Giy Are lie Tyr Pre Asr. Ser Thr Are 



150 



1 = Z, ^.D w 



^ Z Asr Glv Are Pre Thr Asr Tyr Ser Me: Ala Leu Are Are A ; a Phe 

165 17C 

Ala Val Met Val As- Thr Ser Cys Ala Giy Val Thr Leu Cys Are Giy 

i e : 155 - 9 v ' 

qi . . G -.- T - r Ser Are Asn Hi's Thr Giu Trp Giu Asr. Leu Leu 

A*- a Phe Ser Val lie lie Tyr Aia Leu Asp His Asr. Cys His Pre 

21: 215 220 

A* a L*»u Se- lie Aia Ser Giy lie Phe Asp Glu Are Asp Tyr Giy 

225 * " 23C 235 240 

~ pno -> s-r Gin Pro Arg Ser Val Pro Ser Pro Thr Pro Cys Asp 

245 250 255 

Va - S o~- — ^ G'u Asd lie Tvr Asn Glv Thr Tyr Leu Aia Arg Pre Giy 

260 " * 265 27C 

, £ ^ -^rp Pro Asr. Leu Ser Thr Pro Pro Leu He Leu Asr. 

~ " ._r ' 280 2E5 



P h s L v s 



I N r 3RKAT ION* "OH S E 2 ID NO : 2 6 : 

;i SE0UZ NCE CHARACTERISTICS : 

(A- LENGTH : 36 3 base pairs 
< 3 > TYPE : nucleic acid 
;r; STRAKDEONZSS : single 
:2 ' T0PCL03Y : linear 

■in MCLECVLE TYPE : DNA 1 genorr.i z '■ 

iivi Aim - SENSE : 1: 

1 IX i FEATUP-E : 

;A> NAME /KEY: CDS 
(5) LOCATION: 1 . .363 
!D) CTHER INFORMATION : 



ixi; SEQUENCE DESCRIPTION: SEC. ID NO: 26: 

. — ; A*~3 -»t.- GTC TCC ACT CAC CGG AGG AAT GGT GGA CG3 

Mel Ser Me: Tnr Phe Pre Val Ser Ser His Are Arg Asr. Giy Giy Are 



4 £ 
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G GT G CG AAT GGC CAC CAA GCC TCC C« - G«- . - - 

r;- r:; ^Z; Glv Ala Asn Gly His Glr. Ala Ser Arg Asp .rp Ser 

" r 2C 25 J ~ 

. . C rT AG 7 CAT AGG CGC CTG CGT CTA CTG CTG CAT 

^a L-l Pr= Pro Ser His Arg Arg Leu Arg Leu Leu -eu .-.is 

~ 5 " r_ 4C 45 

, . GG" GGC TCG ATT GTG GCG CGT CAC CCC ATT AGC- CAG 

sZi Arg Va_ ?;= Gly Gly Ser Thr Val Ala Arg His ?r: Thr Arg G_n 



5: 55 6C 



GGC CAT CGT GGC GTJ 



GGT CCT TCG CAC CCT GGG ATC GCA GGC CGG 



Glv His Arg G?y val sir Gly Pre Ser His Pro Gly Thr Ala Gly Arg 

-~- a" GCC GAC GGT GGC CAT AGC TAC CCA GGA GCC CTA TCG 

."I" «. w .; —a Aso Glv Gly His Ser Tyr Pre Giy A^a ^eu ?r: 



e5 

- - A G«3 AG A TTA GAA CGG GGT GTG TGC TAT AAT GGA TGG 

-?Z ~1 - : c « ia -a Ar= Leu Glu Arg Gly Val Cys Tyr Asn S-y rp 

* * " "~ - — " 10= — : 



ic-: 

^ T3C- GGG GGG GCT GTA GAT AAT TGA 
L-- Trr Glv Gly Ala Val Asp Asn 
115 120 



INFORMATI ON* FDR SZO. ID NO : 2 9 



' i SZDUZNCE CHARACTERISTICS : 

(A) LENGTH: 120 ammo acids 
CE) TYPE: ammo acid 
ID j TOPOLOGY: linear 

ii KCLECJLE TYPE: protein 

*>:i SZCYENCZ DESCRIPTION : SEC ID NO : 2 9 : 

M -- c — v— Thr Phe Pro Val Ser Ser His Arg Arg Asn Gly Gly Arg 
' 5 -° * r 

-o-. - Ala Asr. Glv His Glr. Ala Ser Arg Asp Trp Ser Tyr 

: ' : ' 25 30 



As- a" a Leu Pro Pro Ser His Arg Arg Leu Arg Leu Leu Leu His 
~ :5 40 45 

^ Aro Val ?ro Gly Gly Ser Thr Val Ala Arg His Pre Tr.r Arg Gin 
\ \ ' 55 6 c 

Gly His Arg Gly Val Ser Gly Pro Ser His Pr; Gly Thr Ala Gly Arg 

— „ ~ r - Ala Aso Glv Gly His Ser Tyr ?r= Gly Ala Leu Pre 

65 " 50 5 = 

Tyr Asn :le His Ala Arg Leu Glu Arg Gly Val Cys Tyr Asn Gly Trp 



10C 

Leu Trp Gly Gly Ala Val As? Asn 

;i; INFORMATION FOR SEC ZD NO: 30: 
;i SEQUENCE CHARACTERISTICS 



ZBc 



3e5 
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(A) LENGTH : 521 Dase pairs 

(B; TYPE : nucleic a=ic 

(C) STRAKDEDNESS : single 

CD'' TOPOLOGY: linear 

TYPE : ONA ( genomi c ) 



iiili KYPOTHETICAL 



: iv ; 



-SENSE: N 



- v FEATURE : 

(A = NAME / KEY : CDS 

LOCATION : 1- .921 
■r? CTHEr. INFORMATION: 

.Xi. SEQUENCE DESCRIPTION: SEO ID NO: 30: 

- ( AG*" CGT CAC AGG GAG CGC CTT GCC GCC AAC CTG GAG GAG 

Z i£ 2er Arg His Arg Glu Arg Leu Ala Ala Asr. Leu G.u Giu 



g— GGA GAG AGG TGG GAA CT3 ACT GCC COG ACA TTC 

ZZZ Hi Z^s Isi Ala Gly Glu Arg Trp Glu Leu Ser Ala Pre Tar Pne 



GA - TTC CAA CAG CAG CTA GTA ATG CAC AC. TT 

val 52s Trp ^ & Pne «» Sir. £. v.: Me, H„ Tnr Pne 



165 



, - — t+ g G — CCA CAA AAG ACG CAC CAG TTG ... 

55 Til. Zz v;i s« =i; & & ?« K1 , m ,~ «. 



192 



24 C 



2 86 



25 

-3- CCC AAA ACG GCA CGG ATG GCG CAC CCT TTT ATT GGC 

^2 cys Pro Lvs Thr Ala Arg Met Ala His Pro Pne 

_ tt r- T-t * T *A'" AGT TCG GTC CTG GAA ACA TAC TGC 

V T? ?I? ^ Arg lie" ^ Ser T^r Ser Ser Val Leu G.u Thr Tyr Cys 

, r — C— GCC ACG CCC ACG TCA GCA AAT CCC GAC GTG GGA 

?Ts Sis Pro Ala Thr Pre Thr Ser Ala Asr. Pro Asp Va. o:y 

"ii "~ 70 /=> 

. „. — G AG GA" AAC GTC CCC GCA AAG CCG CGC CTA TTG 

HI ~C Vrl 'sir Glu As?" Asn val Pro Ala Lys Pro Arg Leu Leu 

E5 90 

, _~ - ~- T — TTG CAG ATG CGG TGT GTG CGC GAG GAC GCG 

Glu Ser Leu Ser Thr Tyr Leu Gin Met Arg Cys Val Arg Glu As? A.a 
' '. 10 z - °- 

GCC GAT CAA CTG GTC GAG TAC CAG GCG GGC AGA AAA 

»?: Ya 7 Ala Asp Gin Leu Val Glu Tyr G.r. A.a G.y Arg s 

115 120 

„ G g— TGC TCT GTC TAC CCC GAA CTT CAG GOT " 3 ~ 

?S kT; ^ iir Ou Ali Cy S S.r V.1 Tyr Arg =-u Leu Gir. A,. 

__ HI „_ .-- -. -r TTT CTG AAC GCC TCT TAC GTT CCC GGG 48C 

L4u vli ^ Lei Ser Phe Leu Asr. Gly Cys Tyr V.l Pre ~-y 

ln= 150 



3B-; 



57c 



is: 165 
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K 3 S S - « S i S 5 K £ 1 - - - 

155 200 



-3 CAA AA2- GCA AGC GTA TTC GTA ATA CCA AGG AGA 

:;r- a- ?te ~s Gin Lys Ala Ser val Phe Leu He Pre = 

s si s: s ?s s s ss a s s s s s s s 



215 



23C 



AA- A"" GGG TAG GTA GGG CAO CAA AAG . 

GT " ~: ^ - : : Giv Tyr Val Ala His Gin Lys His Va. 

. TTC GGG GAA ATG ATA AAG AGG GTT TGT GGG TGG TTI 

-'a Ts~ sir v=I ?he Ala Giu lie lie Lys Tr.r Leu Cys Arg Trp Pne 
26" 2c = 

„ --^ AA A AAG ATG AAG AAG GAG AAG GGA AGG ATA ATG TAG 

Vrc Pzl ^ Lk Ash He Lys Lys Glu Asr. Gly T^ He 

: r. 2 B 0 - 

„„, GS - -GG TGG AGG TGG GTG ATG TGG GGA AGA TGG TTC 

?^ ;« =• v ;« — «« ^ ^ Ala rhr ys 

is: 

AAT AAG AAG 
Asr. Lys As- 

;i INFORMATION FDR SZQ ID NO : 3 1 : 

' - S E G'JEN G £ CHARACTER ISTICS : 

(A: LENGTH: 307 amino acids 
12; TYPE: air.mo arid 
;G TOPOLOGY: linear 

MOLECULE TYPE: protein 

ixi SEQUENCE DESCRIPTION: SEC/ ID NO : 3 1 : 

Me: Leu U u Ser Arg His Arg Glu Arg Leu Ala Ala Asr. Leu Glu Giu 



5 



Tr.r Ala Lys Asp Ala Gly Giu Arg Trp Giu Leu Ser Ala Pre Tr.r Pne 
-, r Ar5 KiS :v S Pre Lys Thr Ala ATS Me, Ala K t . Pro Phe lie «V 

- r. 4 C 

Val V»: Kis Ar= lie Asr. Ser Tyr Ser Ser Val Leu Glu Tr.r Tyr Cys 

— A-= HU H,s Pro Aia T.-.r Pro Tr.r Ser Ala -sr. Pr= Asp Val Giy 

p~ *-e Pr= Ser Giu Asp Asr. val Pr= Aia Lys Pr= Arg Leu Leu 

Gl u Ser Leu Ser Thr Tyr Leu Glr. Ar S Cys Val Arg Giu Asp Ala 

ioc -°- 
„ i8 Vai Ser Tr.r Aia Asp Sir Leu Vai =lu Tyr Gin Aia Gly Arg Lys 



B6«i 



512 



922 



115 i2D 
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235 



Thr His Asp Ser Leu His Ala Cys Ser Val Tyr Arg Glu Leu Gir. Ala 

oh. Leu val Asr. Leu Ser Ser Phe Leu Asr. Gly Cys Tyr Val Pre Gly 

:v= i5c -== 

Va- — Trp Leu Glu Pre Phe Gin Gir. Gin Leu Val Met His Thr Phe 
163 - 70 

Pne P,e Leu Val Ser He Lys Ala Pro Gin Lys Thr His Gin Leu Phe 
I8C 1E - 

Leu ?.e Lvs Gir. Tyr Phe Gly Leu Phe Glu Thr Pr = Asr. Ser Val 



; = : 



— Phe Lvs Gin Lvs Ala Ser Val Phe Leu He Pre Arg Arg 
**" " 115 HO 

His Glv Lys Thr Trp He Val Val Ala He He Ser Me, Leu Leu A ; a 
"25 " -30 

s — Ya Glu Asn lie Asr. He Gly Tyr Val Ala His Gin Lys His Val 
245 

A £ As - cer val Phe Ala Glu He He Lys Thr Leu Cys Arg Trp Pne 
26C 2o£ 

Lvs Asr . Leu Asn He Lys Lys Glu Asn Gly Thr He He Tyr 
z 7 z 280 285 

-~ ?-e Glv Glv Arg Ser Ser Ser Leu Met Cys Ala Thr Cys Phe 

' " 295 300 



is:- 

Asn Lys Asn 

3 ZS 



INFOP-MATION FOP SEQ 10 NO: 32: 

;i SEQUENCE CHARACTERISTICS: 

(A'. LENGTH: 13 6 5 base pairs 
(5) TYPE: nucleic acid 
<C STRANDEDNESS : single 
;C. TOP0L0GV : linear 

■ -_ MOLECULE TYPE: DNA (genomic) 

ill HYPOTHETICAL: N 

■iiv AI"- SENSE: N 

ix FEATURE : 

(A- NAME / KEY : COS 

(B> LOCATION: 1. .I3£S 

(O" OTHER INFORMATION: 

ixi- SEQUENCE DESCRIPTIOi: : SEQ 10 NO : 3 2 



•-3 g;~ G~~ CA" GOT ATO AAO GAA AG A TAC GTA GOT OCT CSC TOO CAC 
M*t Asp 111 Ala He Asn Glu Arg Tyr Val Gly Pre Arg Cys .is 

g-G GTG CTG COT AGO AC 2 TTT CTG CT3 CAT CAC GOO 

l;» sr; 5:? 5:i ^ P r= ^ ^.-.r ^ ^ ^. Ki « ^. 



9€ 



2C 



ATA COO CTG GAG COO GAG AT 2 ATI - -T TOO AC 2 TA_ A__ wo-j- 
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23c 



lie Pre 



Leu Glu Pre Glu lie 



CGG 
Arr 



Leu 
£5 



Ser 
5: 

CCA 
Pre 



35 



Pro 



GGG 



GGG 

Glv 



GAG 
Glu 



TCA 

Ser 



GAA 



AAG 
AST. 

7 0 



CGC 
Arg 

55 

CAA 
Gin 



He 

40 

CGG 
Arg 



Leu 



Phe Ser Thr Tvr Thr Arg Phe 
45 



TTG GTG GTG TGT GGG AAA C 

:%* 

6 



Leu Val Val Cys Gly Lys Arg 



AG g 

Ser 



lie Ser 



Leu Pre Le 



65 



Phe Ser His 



1 1 



G7A 
Val 



Leu 
IOC 



CGC 
Arc 



AAT 
Asr. 



GCC 
Ala 
145 



AGA 

Arg 



AAT 
Asr. 

13 : 

GTA 
Val 



Pr.e Leu Tyr 



CGG 
Arg 



AG 7 
Ser 



ATT 
lie 



Leu 



cgg 

Pro 



Ser 



Ser 



TGC 

Cvs 



CTG 
Leu 
120 



TAG 
His 



Pro 



Pre Aso Pre Ala Leu 
IS5 



GGG 
Arg 



GGG 
Gly 
IB C 



GAG 

Glu 



GGT GGG GGA 
Ala Pre Gly 
155 



Ser 



AGG 
Tr.r 



Leu 
150 

AGG 
Thr 



GAG 
AS? 



j-i. — 

Thr 
135 

GAG 

Glu 



GGT 
Arg 

GGA 

Gly 



Val 



GAA 
Giu 



GGA 
Gly 

GAG 
Asd 



GGG TGT TGA GGT TGT GGT TTG 

Ala Ser Ser Pre Ser Gly Leu 

75 

GAT GGG AAG TTT GAT GGA TTT 

Aso Glv Asr. Phe His Pre Phe 

* 3C 55 

CGT GGT TGT AAT GTT AGT CTT 

Pro Glv Ser Asr. Leu Ser Leu 

105 11C 

GTG GTG GGT ATG GGG GGG GGA 

Val Val Ala Mel Glv Ala Gly 
125 

GAG GGG GTA TGG CGG- GGA GAG 

Asp Giv Val Ser Pro Pro Glu 
140 

CTG CAG AGG CTG GGG CGT GGT 

Leu Glr. Arg Leu Ala Arg Ala 
155 

CGG TTG CAG GTG GTG- AGG GGG 

Pre Leu Glr. Val Leu Thr Giy 

170 1"5 

CGG GGG AGT CAG CAG ATG GGG 

Arc Ala Thr His His Me: Ala 

185 19 0 



Ser 

GTG 

Val 

GGG 
Ala 

s: 

GAG 
Asp 

AGT 



GGG 
Arg 



GGG 
Giy 



AGG 
Thr 

ISC 



Leu 



GAG AAG GGG GGA GGG 



Lvs 



Pre 



AGG 

Ser 
— - 



Phe Asr. Pre Val 



Thr Asr. 



GTG 

Val 



- yr 



GGG 
Glv 



GAG 



Arg 



:~5 



Glr. 
2cC 

GTG 
Val 



Glu 



Phe He Asr. Th: 



~T AAG AGG 



ATG 

29: 



TGG 
Ser 
245 

GAA 
Glu 



CGG 
Pre 



AAG 

Lvs 



GTG 
Val 



GGG 
Arg 



AAT 
Asr. 
23C 

GGT 

Gly 



Arg 



AGA 
Thr 
215 

GGG 
Ala 



GGA 

Giy 
2 00 

CGG 
Arg 



GAT 
Asp 



GGT CGG 
A_a Pre 



GAG 
Asp 



GGG 
Pre 



CAG 
Glr. 



Asp 



Pre 



TGG 
Cys 
255 



TGG 
Trp 



CGG 
Pre 
280 

GAG 
Asp 



GAA AGG CTA GAG CGG CZZ 
Giu Ser Leu Aso Pre Pro Val 
205 

CAG AGG CCA CCG CGG GTG CGA 
His Arc Pro Pro Pre Val Arg 
220 

GTA CGG GGT AGG TGG GGA GAG 
Val Pro Ala Tr.r Trp Arg Asp 
235 

TAG TAT GTG TGT GTT TAG GAA 
Tvr Tvr Val Cys Val Tyr Glu 
' 250 255 

Leu Pro lie Pre Leu Ser Phe 
265 ! — 

GGG TTA GTG TTG ATG GAG GAG 

Glv Leu Val Phe Met Asp Asp 
2E5 

TTT GTG GAG ACG CTA GAG GGG 

Phe Val Asr Thr Leu Giu Ala 
3 00 



Ser 



Leu 



GGG 
Ala 

2-;o 

CGG 
Arg 



GGA 

Pro 



3S4 



4s; 



52E 



57£ 



£"2 
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55 gj s is m i s 2 s ss s s s ?i; s f 

SSSSSgSBSSgSSSSgS 

- G — . — TA 333 333 =TS GCT 7=3 SAC- GCT ACT 3" T33 ATA. 

Ala Cvi Leu ?il Leu Arg 31y Le, Ala S.r 31u A. a ,« ~. -e 

„ — r~~- CTT GGC CGC CAC GGC TGG TGG ATG GAG 

^ Ilk Ala ==r Ser ^ Gly Ar= H,s Ala Cys Try Me-. Asp 

- 3 - —A T33 GAA A3C 333 CCC CAC ACT CTA GGC TT3 SAC- TTA 

val LeG G?y LeG Try 31, Ser Arg Pr= H,s T.,r Leu =.y -e, 
37C 3 ' = 

— — 3 - GGC GGC ACG GAG GGT GAG TGG TTA GAG ATT TTA 

Arc ~Iy ?•£ ^ Cys Gly Gly Tfcr Asp Gly As ? Try Leu 3--- -,e ^ 
323 390 

m m s s i a 5: s si e m ss s si? a s; 
5? 5?; si: s s a s; ssss s s s s 



420 



s — CGC TAT AGG GCG TGG AAG GAG GAT CTG GTG TTC 

Phi Ala zie 2£ Xla Lg Tyr Arg Ala Ser Lys Glu Asp Le, Va. Pne 



435 440 



ATT CGA GGC TGG TAT GGC TAG 
lie Ar = Gly Arg Tyr Gly 
4s: 



124E 



125c 



1344 



I3c = 



:z i!;fg?j-*at:ok fop. seg. ic ko:33: 

SrCC/ENCE CHARACTERISTICS: 

(A LENGTH: 4 54 ammo acids 
(E^ TYPE: amino acid 
{C- TOPOLOGY : linear 
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,eu Ser Leu Pre Leu Phe Ser His Asp Gly As. Phe His Pre Phe Asp 

B5 9L 

Zie Ser Val Leu Arg lie Ser Cys Pro Gly Ser As, Leu Ser Leu Thr 

ICC 103 

va" Ar= Phe Leu Tyr Leu Ser Leu Val Val Ala Me-. Giy Ala Gly Arg 

115 — 0 

, sr . hsr . Ala Arg Ser Pro Thr Val Asp Gly Val Ser Pro Pro Glu Giy 
i3C - 3 = 

H« val Ala ?r= Leu Glu Giu Leu Gin Arg Leu Ala Arg Ala Thr 

- . - 15 0 - - b 

^ ** r 

A' a Leu Thr Arg Gly Pro Leu Gir. Val Leu Thr Gly Leu 
— * _ 5 170 

Arc Ala Gly Ser Asp Gly Asp Arg Ala Thr His K.s Mez Ala Leu 

-*■-. -"a Glv Thr Val Arg Gly Glu Ser Leu As? Pre Pro Val Ser 

dr. Lvs Gly Pre Ala Arg Thr Arg His Arg Pro Pro Pro Val Arg Leu 

lie ri= 

Ser Phe Asn Pro Val Asr. Ala Asp Val Pro Ala Thr Trp Arg Asp Ala 

125 ~ 30 

Thr As, val Tyr Ser Gly Ala Pro Tyr Tyr Val Cys Val Tyr Glu Arg 

Glv Glv Arc Gin Glu Asp Asp Trp Leu Pro lie Pro Leu Ser Phe Pro 

Glu Glu Pro Val Pro Pre Pro Pre Gly Leu Val Phe Me: Asp Asp Leu 

Phe He Asr. Thr Lys Gin Cys Asp Phe Val Asp Thr Leu Glu Ala Ala 

1 5 C *~ 3 

-v» Aro Thr Gl, Giy Tyr Thr Leu Arg Gin Arg Val Pro Val Ala lie 
3 Or " 310 

=ro Aro AS, Ala Glu lie Ala Asp Ala Val Lys Ser H,s Phe Leu Giu 

22: - 30 
Ala Cys Leu Val Leu Arg Giy Leu Ala Ser Giu Ala Ser Ala Trp lie 

A- A' a Ala Thr Ser Pro Pro Leu Giy Arg Hu Ala Cys Tr? Me". Asp 
* 355 360 

Val Leu Gly Leu Trp Glu Ser Arg Pro His Thr Leu Gly Leu Glu Leu 

Ar= Glv val AS, Cys Giy Gly Thr Asp Gly Asp Trp Leu Giu lie Leu 
3El ' 39 = 3S " 

Lvs Gin Pro Asp Val Glr. Lys Thr Val Ser Giy Ser Leu Val A.a Cys 

405 4 ~ J 

val lie val Thr Pro Ala Leu Glu Ala Trp Leu Val Leu Pro Gly Gly 

420 4 ^ 
Phe Ala lie Lys Ala Arg Tyr Arg Ala Ser Lys Glu Asp Leu Val Phe 
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Arg Gly Arg Tyr Gly 

INFORMATION FOR SEO ID NO: 34: 

SEOUENCE CHARACTERISTICS: 
tA. LENGTH: 964 base pairs 
( b • TYPE: nuclei:: a;id 
;:■ STRANDEDNESS : single 
(0 TOPOLOGY: linear 

-1 KCLECYLE TYPE: DNA (genomic; 

KVFCTHETI CAL : N 

.iv Aim -SENSE: N 

[ ix FEATURE : 

{ A 1 NAME / KEY : CDS 

IB LOCATION: 1..9B4 

;C OTHER INFORMATION: 



23 9 

44 0 44 5 



ixi. SEQUENCE DESCRIPTION: SEQ ID NO: 34: 

AGC TCG CTC GTG TCC GAG GGT GAC CCG GAG GTG ACC 

P'ne Ala Leu Ser Ser Leu Val Ser Glu Gly Asp Pro Glu Val Tnr 
5 ID 

■ .„ - ~ ,3— AAG GGO GTA CAA CTT GCC CTG GAC CTT AGC GAG AAC 

7H va r Lvs Glv Val Gin Leu Ala Leu Asp Leu Ser Glu Asn 



20 



AAG TTG ATA GAA ACT CCC CTG AAC AGC 



3-3 — CTT CGG GAG GOT GTA GAT GGC CTG TGT GAC CCC 

£n ~s A« Xla Leu Leu Arg Glu Ala Val Asp Gly Leu Cys Asp ,ro 



.3 C 



48 



96 



144 



. „ T 33- *~AA "™ A^LU - * O (J^WW rt- i W _ .1 w • - 

-:C -v Gin Phe Lvs Leu lie Glu Thr Pro Leu Asn Ser Phe Leu 

3= • 40 45 

— G G — TCI AAC GTG ATG CCC GAG GTC CAG CCA AT0 TGC ACT GGO CGG 

-"" as- Va" Me: Pro Glu Val Gin Pro lie Cys Ser «iy Arg 

- e - V J: — " 5= 6C 

- - 3 - CCA GAC TTT AGT AAT CTC CAC TTG COT AGA CTG GAG 

Vrc III Leu Arg Pro Asp Phe Ser Asn Leu His Leu Pro Arg Leu Giu 

. 3— CTC GGG CAG GGT TT0 GGG GCG GCG GGT GAG GAA 

Leu Sl7. L~ Va! Leu Gly Gin Gly Phe Gly Ala Ala Gly G.u Giu 
"** 6 5 9 0 " - 

CAC "A SAA ACA CA= 3 AA AA3 GGC :A3 GTG 23€ 

x:; Liu x» ier b» v .i =iu «*■ KiS l ys «>■ =i, v., 

ico 10:1 " w 

GCT ACC GAC- GAG TGI- ACG TGI- GOT TTG ACT CTG 3B4 

pi: 52s Tv; Ala Thr Glu Glu Trp Thr Trp Ala Leu Tnr .eu 

*** 115 " 12C 



191 



:4C 



2B £ 



431 



if* S £ in- S S - E ~= - ™ a 1 

- - - 150 -- = 



14 = 
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~- G ~ G GGA CZZ GZZ TGT TTT TGT CGG GCI GAG TGT TGI CTG 

~" £eu Phe Asn Giy Pro Ala Ser Phe Cys Arc Ala Asp Cys Cys x,eu 



Leu 



1£5 



170 175 



"A" AA" GAG CAI TGC GGT TAG CGG GGG CCG GTG GTA C- - «- f . — - 

~.. s Glr u-c cvs Glv Tvr Pro Giy Pro Val Leu Leu Pre «.y r.:s 
* 180 " 1E5 190 

— . -A' G~ CCI AAA CGG GAT GTT TTG TCG TTI GTT AAT CAT GGG CTG 

P;;: f v ; Pr - L vs Arg Asp Leu Leu Ser Phe Val Asn His A.a Leu 

* " * 1 ? 5 * 2 00 2 0 5 

TTT GTA TAG GGA GAT TTT TCI GGG ACA TGG GGG GGG 

- Lvs Phe Leu Tvr Giy Asp Pr.e Ser «.y .r.r .... —a ~-<= 

::; *"" - -L= 220 

c — TTI GGT ACT TGT GGG ATA. CAA AGG GTA GTG AGT 

- 7 k ^-s = »rs Pre Phe Ala Thr Ser Arg lie Gin Arg Val Val Ser 

^3 ^ 23C 235 240 

— AAA ATT ATA GAT GGT TGC GAI AGT TAT ATT TGC GAG ACI TGI 
mo- ---s --e He Asr Ala Ser Asp Tr.r Tyr He Ser His Tr.r .ys 

°— * 245 * =50 =£5 



76 = 



£16 



B64 



S6C 



984 



, - TG - CAG ATA TAT GAG CAA AAT AGG ATA ATT GGG GGT GAG GGG 

£ e " L«* n ys Kis He Tvr Glr. Gin Asn Ser He He Ala Giy Gin Giy 
260 * 265 270 

-A" GTG GGT GGA ATG GTA CTG TTG AGT GGA AAA GGG AGG GAG TAT 
«-s va" Glv GIv He Leu Leu Leu Ser Giy Lys Giy Tr.r Gin Tyr 
-75 ' * 280 285 

-~A A~A GG~ AAT GTT CAG AGG CAA AGG TGT CGA AGT AGG GGG GAG TAT 912 
-"2 -H G -. v As - val Gin Thr Gin Arg Cys Pro Tr.r Tr.r Giy Asp Tyr 
290 * 295 300 

, A — A — "CA TCG TAT GAG ATA CCG GGG ATI ATI ACT ATG ATG AAG 

71" l - : I Ser Tvr As- He Pre Ala He He Thr Me: He Lys 

305 310 ' 315 320 

GAG AAT GGA CTG AAG CAA CTG TAA 

Glu Asn Glv Leu Asn Gin Leu 
325 

::;fgrkat:o:: tz?. seg id ng:35: 
i seiczkie cha=actzrist:gs : 

(A; LEKGTH : 3 2" air.ir.s a rids 
(3 ' TV? I : a -mo arid 
[Z ■ TGPGLGGV : linear 

KCLECCLE TYPE: prc^em 

;k* SEQUENCE GESCP.IPTIGIC: SEI IT- KG: 25: 

*h- Ala Leu Ser Ser Leu Val Ser Glu Giy Asp Pre Glu Val Thr 
. 5 1C -= 

Ser Ar= Tvr Val Lys Giy Val Gin Leu Ala Leu Asp Leu Ser Glu Asn 
20 25 J - 

p „ G -, v Glr . phe Lvs Leu He Glu Thr Pro Leu Asn Ser Pine Leu 
35 ' 40 45 

Leu Val Ser Asn Val Met Pre Glu Val Gin Pre He Cys Ser Giy Arg 



5! 
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r>-~ a" a Leu Arr Pro Aso Phe Ser Asn Leu His Leu Pre; Arg Leu Glu 
"e= " " ?c 7= e: 

Lv« Leu Glr. Ar~ Val Leu Gly GIr. Gly Phe Gly Ala Ala Gly Glu Glu 
- £5 9= 53 

A* a Leu Asr Pre Ser His Val Glu Thr His Glu Lys Gly Gin Val 
10C- 105 11C 

Phe Tvr As- His Tyr Ala Thr Glu Glu Trp Thr Trp Ala Leu Tr.r Leu 
115 HO 125 

Lvs Asr Ala Leu Leu Arg Glu Ala Val Asp Gly Leu Cys Asp Pre 
13 ; 13 5 "l4 ": 

G*v Thr Trr Lvs Giv Leu Leu Pre Asp Asr? Pro Leu Pre Leu Leu Trp 
— 5 " 15: 155 16: 

L»u Leu Pr.e Asr. Glv Pre Ala Ser Phe Cys Arg Ala Asp Cys Cys Leu 
165 17C 175 

~v- Lvs Gl- His Cvs Gl-.* Tvr Pre Giv Pre Val Leu Leu Pre Gly His 
is; * ies is:- 

M»- tv- Ala Pre Lvs Ar= Aso Leu Leu Ser Phe Val Asr. His Ala Leu 
155 " 200 205 

Lvs Tvr Thr Lvs Phe Leu Tat Gly Asp Phe Ser Glv Thr Trp- Ala Ala 
li: * 215 220 

Ala Cvs Ar= Pre Pro Phe Ala Thr Ser Arg lie GIr. Arg Val Val Ser 
225 ' ' 230 235 24C 

Gl- Me: Lvs lie lie Asp Ala Ser Asp Thr Tyr lie Ser His Thr Cys 
245 250 * 255 

L*u Leu Cvs His lie Tvr Gin Gin Asr. Ser lie lie Ala Gly GIr. Gly 
250 " 255 27C 

His Val Gl-.- Glv lie Leu Leu Leu Ser Glv Lvs Gly Thr GIr. Tyr 
2'- ' ' 280 2e5 

He Thr Glv Asr. Val Glr. Thr Gin Arg Cvs Pro Thr Thr Gly Asp Tyr 

" 2BZ 2 95 3 0 0 

Leu lie He Pre Ser Tyr Asp lie Pro Ala lie He Thr Me: lie Lys 

305 310 315 32C 

Glu Asr. Glv Leu Asr. Glr. Leu 
325 

;i' ::;-opj<at:di: for sec zz no:3£: 

.1 SZCL"TNCZ CHARACTERISTICS: 

;A' LENGTH: 3 3 0 base pairs 

TYPE: nuclei: acid 
(C: STRANDEDNZSS : single 
:C ; TOPOLOGY : linear 
In- KOLECVLE TYPE: OKA (genome: 

liii* HYPOTHETICAL: K 

(;v! A!":- SENSE: K 

Ixi- SEOUENCE DESCRIPTION: SEC. II N0:3e: 
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ggat 



AGTGGGACAG 




^cccag CT 

%r GG — ttgaCC CCGTGTTCCC CATGGTCGTG CCGCAGCAAC TGGGGCACGC 
TATT CTG GAG CAGCTGTTGG TGTACCACAT 
TGATGTAAAT ATGGCGGAAC TTGATCTATA TAG 
ATATCGTCT3 G A CGTAGAC A ACACGGATCC 



ATATC33CCG GGGCCCCGGA 
.T GTGTCATTTA TGGGGCGCAC 



6C 

ISC 
240 

23: 



;z; information for se~ i- no: 37: 
:- segue nce characteristics : 

;A LENGTH: 62" base pairs 

{=■ TYPE: nucleic acid 

■Z: STRANOEDNESS : single 

(G. TOPOLOGY: linear 

i:i. KCLECGLE TYPE: DNA (genomic) 

in: HYPOTHETICAL: N 

i;v; Aim - SENSE : N 

txi: SEQUENCE DESCRIPTION: SEQ. ID NO: 37: 
GGATCCGCTG GCAGGTGGGC GCGGACGTGG TCGGGTAGGT TGGAGAGAAA 
CCA3TCC3CG CCGTAGCGCC TGGAGGTGGC TGAGCACCGG GGGGGGGTGA 
TTA3TCCGGA GAAGATAGGG GGGTTGGGAA GCCGCTGAAC CAGCTCCAGG 
T3C3CACC3G TTGTCGGAGC TGTCG OGATA GAGGTTAGGG TAGGTGTCCG 
GGGCTCAAAC CTGCCCAGAC ACAO: 

g: 

GC3ATGGTGC GOAOGGTTTT TAAGAAAOCO CGGAGGGTGG GGACTOCOGC 
A-GTOGGGGT GCTGTACGTC CTTGGGGAAT ATGGGAOGAA ATCGGCTGTG 
OOGAGGGGGG GTOOGGTGGC ATACAGGCCG GTGAGGGGCO GOTGGGTOTG 
AACAGGGTGC TGTGAAAGAA CAGGTTGOAA GGCCGCGAAT ACCCCTCTG Z 
GGAGGTGGGT GTATGCTCCG TGGATCC 




CAGGTGGAGG 
TGCGATGTGT 
GTCTCCAAGA 
GTGGGTGGGT 
TGAGGGAGAT 
AGGAGTGGTG 



CGCACGGGGT 
TGCGGGTGGA 
AGGGTGGTGT 



information for seg. 



NO : 3 e 



■;i SECUENOE CHARA.CTER.ISTICS : 

;A LENGTH: 233 base pairs 
(Hi TYPE: nucleic acic 
( z ■ STRANDEDNESS : single 
;o: TOPOLOGY: linear 

!:i? MOLECULE TYPE : DKA (genomic) 

(ill) HYPOTHETICAL: N 

!:•:! Aim -SENSE: N 

t>:i, SEDUZNCE DESCRIPTION: SEQ ID Iw : 3 e 



60 
120 
1BO 
240 
300 
360 
42 0 
4 8 C 
540 
6 00 
€2" 
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AG C CGAAAGG ATTCCACCAT TGTGCTCGAA TCCAACGGAT TTGACCCCGT GTTCCCCAT3 c'Z 

GTCGTGCCGC AGCAACT33G GCACGCTATT CTGCAG CAGC TGTTGGTGTA CCACATCTAC -ZZ 

-^j^j;--- CGGCCGGGGC CCCGGATGAT GTAAATATGG CGGAACTTGA TCTATATAC. - = - 

ACCAATGTGT CATTTATGGG G CG C A CAT AT CGTCTGGACG T AG A C AA C A C GGA -3: 



information for sec 



NO : 3 & : 



sequence characteristics: 

(A> LENGTH: 32E base pairs 
: h TYPE: nuclei- acic 
•:c STRANDEDITESS : single 
:PCLOGY: Linear 



iii KCLir-"-: TYPE: DNA (genomic) 
i i i i HV?CTH£T I CAL : N 
; :v AT"! -SENSE: K 

SEQUENCE DESCRIPTION: SEQ ID NO : 3 S : 
GAAATTAC C C ACGAGATCGC TTCCCTGCAC ACCGCACTTG GCTACTCATC 
CCGGCCCACG TGGCCGCCAT AACTACAGAC ATGGGAGTAC ATTGTCAGGA CCTCTTTATG 12 0 
ATTTTCCCAG GGGACGCGTA TCAGGACCGC CAGCTGCATG ACTATAT CAA AATGAAAGCG 
CCGGCTCACC GGGAAACAGA ATGGATCACG TGGGATACAC TGCTGGGGTT 

3 — G-G AGAACCTGCC CGGTTTGAGT CATGGTCAGC TGGCAACCTG CG AG AT AATT 

CCCACGCCGG TCACATCTGA CGTTGCCT 



6 0 



18 0 
240 
300 
32E 



INFORMATION FOR SEQ ID NO: 40: 

:-. SEQUENCE CHARACTERISTICS: 

(A' LENGTH: 132 base pairs 
(£■ TYPE:- nucleic acid 
;r STRAIOEDNESS : single 
ID- TOPOLOGY: linear 

in": MOLECULE TVPE: DNA t genomic) 
KY? C THET I CAL : N 
AirTI - SENSE : N 
<>:i' SEQUENCE DESCRIPTION: SEC ID NO : 4 C : 

G-GCAGGAGT GACATTGTGC CGCGGAGAAA CTCAGACCGC ATOCCGTAAO 

CACACTGACT GGGAAAATCT GCTGOCTATG TTTTCTGTGA TTATCTATG0 CTTAGATCAO 

io: 

AACTGTCAC0 CC- 

INFORKATION FOR SEQ ID NO : 4 1 : 

f=J SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 4 0 base pairs 
(E; TVPE : nucleic acic 
(C! STRANDEDNESS : single 



6C 



WO 96/06159 



PCT/1?S95/101 



244 

(D) TOPOLOGY: linear 
(ii'i MOLECULE TYPE: DNA (genomic) 
iiil) HYPOTHETICAL: N 
(iv; ANTI -SENSE: N 

ixi - SEQUENCE DESCRIPTION: SEQ CD ND:4i: 
AGCCGAAAGC- ATT CC AC CAT TCCGTGTTGT CTACGTCCAG 

INFORMATION FOR SE2 ID NO: 41: 

\i SEQUENCE CHARACTERISTICS: 
(A- LENGTH : 3S base pairs 
(=: TYPE: nucleic acid 
! C ■ STRANDEDNESS : single 
[Z TOPOLOGY: linear 

;ii) MCLECULE TYPE: DNA (genomic) 

: i i i ' HYP CTHET I CAL : N 

(:v ; ANTI -SENSE: N 

txi!' SEQUENCE DESCRIPTION: SEQ ID NO: 42 
G AAATT AC CC ACGAGATCGC AGGCAACGTC AGATGTGA 

;;, INFORMATION FOR SEQ ID NO:43: 
{ — SEQUENCE CHARACTERISTICS : 



iA 



LENGTH: 46 base pairs 
TYPE : nucleic acid 
STRANDEDNESS : single 
TOPOLOGY: linear 



t>:i 



KOLECULE TYPE : DNA (genomic) 
HY ? CTHET I CAL : N 
ANTI -SENSE: N 

SEQUENCE DESCRIPTION: SEQ ID NO: 43 



3B 



:gtcat gtgcaggagt GACCGGGTGA CAGTTGTGAT CTAAGG 4 6 



[!• INFORMATION FOR SEQ ID NO: 44: 

; i ■ S E QUEN CE CHARACTER ISTICS : 
(A. LENGTH: 2 1 base pairs 
(Hi TYPE: nucleic acid 
;C? STRANDEDNESS: single 
(D> TOPOLOGY: linear 

; i i ; K 0 LE CULE TY? E : DNA ( g e n Dir.: c > 

■; i i i ) HYP CTHET I CAL : N 

!:v! ANTI -SENSE: N 

ixiJ SEQUENCE DESCRIPTION: SEQ ID NO: 44 



WO 96/06159 



PCT/VS95'10W 



ACAGGGTTGG TTGCCCAGGS 7 

ikedp-katiok fop. szz: nd:45: 



SEDYENTE CHARACTERISTICS: 
( A LENGTH: 2C base pairs 
{ h ■ TYPE: nuclei- a::: 
( c '■ S TP.ANDED1JZS S : single 
(I- TOPOLOGY: linear 

MOLECULE TYPE: DNA t genome • 



;:v AN*TI- SENSE: K 

ixi SEQUENCE DESCRIPTION: SEQ ID NO : 4 5 : 
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whs- is claimed is: 

1. Ar. isolated DNA molecule which is a: _easc - - 
nucleotides ir. length and uniquely defines a 
herpesvirus associated with Kaposi ' s - sarcoma . 

2. The isolated DNA molecule of claim 1, wherein tne 
isolated DNA molecule is cDNA . 



isolated DNA molecule of claim 1, wherein tne 
ated DNA molecule is genomic DNA. 



ISO- 



An isolated RNA molecule which is derived from 
the isolated nucleic acid molecule c: o^aim 

The isolated DNA molecule of claim 1 wmcn is 
labelled with a detectable marker. 

The isolated DNA molecule of claim 3, wherein the 
marker is a radioactive label, or a caiorimetric , 
a luminescent, or a fluorescent marker. 

A reclicabie vector comprising tne isolated DNA 
mtlecule of claim 1. 

A clasmid, cosmid, A phage or YAT containing at 
least a portion of the isolated DNA molecule of 
claim 1 . 



An isolated herpesvirus associated with Kaposi's 
sarcoma . 
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A nucleic acic mciecuis cz «- -s»s- 
r.uciectides capable cf specifically r.ycnci c tr.g 
with -he isolated DNA molecule cf c T &-rr. 1 . 

A DNA molecule cf claim 12. 



- c 



2 0 



nucleotides capable cf spe cic i ca__;. 
with a nucleic acid molecule which 
complementary :c "he isolated DKA molecule c: 
c _ a i m _ . 

A nucleic acid molecule cf claim 15 wherein the 
nucleic acid molecule is capable c: nyoncizmg 
with moderate stringency *o at least a portion cz 
a nucleotide sequence as shown m Figure 3A (S5Q 
II NO : 1) . 

An isolated oeotide encoded by* at least a portion 
cf a nucleic acid molecule with a sequence as set 
forth in ID NOs : 1-27", . 

A host cell which expresses the peptide cz ciairr. 
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The antibody of claim 21, wh 
monoclonal antibody. 



ereir. "ne anticccy 



The antibody of claim 21, wherein tne ar.ticccy !=■ 
a polyclonal antibody. 

The antibody of claim 21, wherein the antioccy is 
labelled with a detectable marker. 



The labelled antibody of claim; 24, wherein t 
marker is a radioactive label, cr a ca-orimetri 
a luminescent, cr a fluorescent marker . 



2i . Ar. ant i sense molecule capable c: nycri citing tc 
-he isolated DNA molecule of claim 1 . 

2 - The antisense molecule of claim 2c, wherein the 
molecule is a DNA. 

2 z The antisense molecule of claim 26, wherein the 
molecule is a RNA . 

crioiex oligonucleotide capaoie oz nyorioitmg 
with a double stranded isolated DNA mclecu_e cr 
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from a turner lesior. of zhe subject; 
ccr.:ar-:ng -he nucleic acid molecule with : 
labelled nucleic acid molecule cf claim i:- un: 
hybridizing conditions; and to: ne ce rm.tr. ir.g 
presence cf the nucleic acid .mole :u. e - z - 

Kaoosi ' s s arc orris . 



The method c: c^aim -2 wneretr. -r.- 
frcrr. the tumor lesion is amplified before step 

(b) . 



r-. 



method of diagnosing Kaposi's sarcoma wnicn 
comprises: (a) obtaining a nucleic acid molecule 
from a suitable bodily fluid cf a subject; lb; 
contacting the nucleic acid molecule wicr. tne 
labelled nucleic acid molecule of claim. 13 under 
hybridising conditions; and iz" determining the 
cresence cf the nucleic arid molecule 

v...'-,.*- - — - t a -* "he oresence oz wnicn is moica — ve 
cf Kaposi's sarcoma m tne subject, thereby 
diagnosing Kaposi's sarcoma. 

p. method of diagnosing a DK.-. virus associates 
with Kaposi's sarcoma which comprises la) 
obtaining a suitable bodily fluid sampie zrom a 

thereto a Kaposi's sarcoma antioooy, s: as 

bind Kaoosi's sarcoma antioooy to a spec 

Kaoosi's sarcoma antigen, -c removing unoouno 



bodiiy 



i"""om tne s u c c c _ — , — 
i level or Kane si s o^r— Omo 



._ fc ;-^«^ v ho 1 — 1 bv the Kaoosi's sarco.ua antigen 
therebv diagnosing Kaposi's sarcoma. 
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3£ . A me -hod cf diagnosing a DNA virus ass:: 
with Kaposi's sarcoma which comprises 
obtaining a suitable bodily fluid sample f: 
subject, (b) contacting the suitable bodily 
cf the subject to a supper" having a_reacy - 
thereto a Kaposi's sarcoma antigen, so as :: 
Kaposi's sarcoma antigen to a sps::::: Kap 
sarcoma antibody, (c) removing unocunc r: 
fluid from the support, and id. deierminm: 
level cf the Kaoosi's sarcoma antigen ecu: 
the Kaposi's sarcoma antibody, thereby diagn: 
Kaoosi ' s sarcoma . 



— a - — — 



A method cf treating a subject wicr. Kaposi s 
sarcoma, comprising administering :: the subject 
an effective amount cf an antiser.se molecuie c: 
claim 26 under conditions such that the antisense 
molecule selectively enters a turner cell c: tne 
subject, so as to treat the subject. 

A method for treating a subject with Kaposi's 
sarcoma ( KS ) comprising administering tc the 
sub'-ec: having a human herpesvirus - associates: KS 
a pharmaceucieally effective amount cf an 
antiviral agent in a pharmaceutical!;.* acceptable 
wherein the acient is effective tc treat 
the subject with KS -associated human herpes virus 
c f claim 12 . 



A method c: propny.axis or treatme: 
sarcoma ( KS ' by administer mr tt 
ris>: for KB, an antibody that bind 
he roes virus cf ciaim 12 m a pr 
acceptable carrier. 



4 C . A method cf vaccinating a subject a-amst 
Kaoosi's sarcoma, comprising administering tc ^n— 



pcm : s95/iow 
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subject an effective amount cz — 
claim 1", and a 

cherebv vaccinazins tne surjs-- . 



d a suitable a=cep:ar.e carrie: 



A method of immunizing a sub-ect. against a 
disease caused by the herpesvirus associate- vitr. 
Kaoosi's sarcoma which comprises administering cc 

A method for preventing the development cr 
cransmission cf herpesvirus associated Kaposi's 
sarcoma m a subject by treating a sucject witn 
Kaposi's sarcoma (KS) comprising administering to 
che subject, having a human herpesvirus - associated 
K£ a pharmaceutical!}' effective amount of an 
antiviral agent in a pharmaceut i caliy acceptable 
carrier, wherein the agent is effective to 
ore venting the development or transmission c_ ^ne 
v c - associated human herpes virus or c-aim . 
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FIGURE 2A 
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FIGURE 2B 
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FIGURE 3A-1 

SEQ. ID. NO. 1 

tcgagtggga gagttgggag AGGtr— gag gtggttgtga zztzzzz^zz zzzttz-ztzz = . 

~-- r3A 7 — CA3AATTACG CA3TCGGGAA T7GTG7G GGZ 7AGAG7CA" T~AAAGAA7A 1 5 : 

A7GTGTGGTG TGGAAGGGGA GGGTTGTGGG GCCSC" TT TA3AAAGCGG CATAGATGGG 2*: 

GCAGG-3TGGA G7A77T GAGG AGGG3GCGG7 AGG7GOGCAG G7GGGGCC3G TTAr77G7Tr 3" 

777TGCG7GC TGGTG-GAAGC 7TGTTCA3GG A AAG CTCGGGCTC3 GTTGGAGGTA 3£- 

CCA7GGCAGA AGGGG377TT GGAGCGGAGT GGG7GGGGCG CGGC3GAGAA AAGGG77GTG n Z Z 

TGAG7AGGG-G ASGCAGGTGG GACTTGGGGA GTTGGGAGGA CGAATCAAGC AGGTrrACAA 4BZ 

CGAGCAGGGA 7ATGGACGAC GTCCGTGAG-G AGAG-GAAACT ACTAACGGGA AAG777G7AA 54 : 

AAACC7GGTA CAT ATA Z G A Z GTGCCCAGGG TCGGGACGAG CAAGCGG7GG CA77T AATG Z ZZZ 

AZGAGAAGTr =— TACGGA ACGCCTAGG7 TTGGGGCCAG AG~GT=ATA CGGGAGCGTT HZ 

Z Z G AAAAAG-G CAGCATTTT7 GCCAGTGGGT TGTZAGGGAG 7GAGGACGAG 7GGGGAGAC7 ~ZZ 

. r -, -iSGG- GT3~G3T~GC ZZTZZZZZTZ ~S "- 

ACGCGGCAA. ^v^--c ti ° o . o - 

, ... -3-3--AATG GGGGAGG7G3 B ■; C 

GGGGTGGAAA TGAG ~ --^ - ^ - - - aiJ ^ 

GGT3-GGGGGA 7G7G GAGG-GA CTGAAGAGGA GGGGAAAGGG A AAAA A_A. _ « 

AGGGG-GGGAG TTTGAAAGGG GGTGGAGGGG ATGTAGGT3A rCGTCTGASG GAZGGGGGGT 5S : 

„„_ A3 -rrGTAGOGGG GTGAAATGTG GGA7ASGGCA AAAGATTAAA 7GATG-GTTGG ZZZZ 

GGG7AA7GGG 737GGG3AAA TGAAGGGTGG 7GAACGGCGT GTGGGGGATG TTGCGGGAGG ZZZZ 
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FIGURE 3A-2 




: aag: 



:ZZ Z 3A3CC3A73- 737AC73GAC GA33G:A77T A:AGA..3.- 
"CTS A73AAGTCTG 37AAGGCGG3 A3ACrC3C73 a:37C73CCA 

. . GA33AATG - 7 Ar~3r7A73- 

— -A-A-T - ATGCCAAAAC AA3T --- " 

-3:=^ =c,=crrr=, ^ttmcw ct^== — — — 

;„ TT3AT , 3 — — 

TA ~_-- AT rT— A3A33C A3A3AAG3CG 

lllll AGC 3 "3A37C3T7 33333333" A33O33A30O 

A33T3-3T33C 3A 

"AA3AAAGAA 33A3333A33 GTGGAGCAAA ACT A CATC A3 A-.- — 

I: 3 ==37=7* rs:^ atktsttk 

. AC3A3C A3 AG CATS CT ACC7ATGA" ACC33737AC 

^3GA33AAAC . 

7GA3AAAA7T ACAA7TTA7C 3TA^-^-^ 

_ -~ AAACCC A 33 3 3 CAT CA 

—-^-^A AAT CTACA3-3 CA3A7CCTG7 CCAA- - — — 

" ^.^^-^ rCTA3A3GA3 ACAT3CAGG-3 

--TAGrCTTC 7T3GCGGCC7 TT3CAT3CT3 333A73CA7A 7C37TGACA7 

3A-AA" AACCC3CT3C 3~CAC3CA3C 7CA73J 

GAGAAICAAr C7C73CA7A3 AAC7^~~- - 

^A-CT3TT3 AA7373A7CA C33A3CC3GC CCT3ACAGA3 77: 

. . ! -;AACAA3A7 

-3CC3A33AC CTCA33G7AA 37 77 3 AAAAA GAG 3 C AAA j - - " ~ 




:C 7AAA7733CA 



iz~: 
zzz: 
;-5: 
I-;-;: 
: = 

is.: 
i6b: 

isc: 

186C 

is: : 
:96 : 
20* : 

lie 

:r s 
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FIGURE 3A-3 



A37T37GA7C 7"GGA3A::G GCGA7337TA 
7 7 AT AA I ATI ArCAAGGG" 7TAA77A7AG 

— rAA"r:- cstctgstaz tsggtgatat 

A773A773-G3 AAGAAC3AG7 rZSGCGTAGC 
GGTC3TCAr3 — GACAAGA 3377::AAAGA 

s^rrzArrsr catctttgaa 

7SGC 

Cr7TA:A3AC- AA7TAGACTC GZATATTTCT 
GAGGCGGACG A773AGA3CA GGAT73TA7C 
AGAG-3GG*GGG CTGGAZATAT 7rTTGG7377 
7GC37SG3-GC A7T37CGAG: ACZGATTTGT 
GGAA77G7A7 TTTTTCCGCC GCATrTCGCG 
G7A7AACCAC -~A~T7T3 GtTGCTG7GGC 



7377A7GAAA 

A3 A7CA3GAGA3 GAAGAAGAG3 

C7A33A7GAG AGCGA73G3C 73CAG77777 

GAACrTCTTT T7AGA7AA37 CA77 

G7GG77GA33 GA7AAG3GGA 3CG7AG7GGA 

TGTTT 73GA7CrT7T A7TAGA33G7 



77G7A7GG GGA7A7A777 



A7TGC737GA CA7A7CAGT7 C3A3CA"" 
T7TGGGG7TA GACA7GA7AA A37A Z 
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FIGURE 3A-4 

733G337T33 GGAA7GG3T7 7 7 77 AAA 3 GG 3AAA777777 A7337A—3A G33A3AA3AT 4 5£ 

3333733737 A3333A3377 73 A37 77773 7AG3A3A3G3 373AA3AA77 3773777-73 - ~~ 

737— 7 73 3 A G-3GG3AAG77 373733737A GA33GA73A3 AAA3A33A3A 33A737777T 46 = 

3A3r"7A73 77GC33A33A AGA737377~ 3AGGAA-A33 7G7737A337 G3777AG777 4 "4 

"AG 777 GG 37 A3AAAAG77 A 7rGA37TG77 AGGA333733 A73A7337AA 33373773AA 4 E ! 

33333337CG 7737773737 7AAG7GAGA3 GA7A7G7GG7 7G3IAAAG77 37333A73 - 4=3 

37 377-7 3J.G7 A7AGA777GG 777GA33GG3 A73G3G7G7T A7GG337GA3 33G37AAA3G 4=z 

3GA3777GG3 73AAA3A33~ 77A7G33G7T GG3GG37737- 733A73A3GA 3A 3 3 7773— 5 34 

733GG3G7G7 A7G3GGAG33 GA3GGGA7G3 3G77GG3AGA 777A7AA777 7AAAG7TGG7 z.Z 

A7AAGAG7GG 73G773G77A 7GG33AG33G GGAG7333G7 AG7A777G3G 737 3 37 33 AA 516 

77737GG333 337A3GA77G G377GGAG73 3AGG7AAA33 33AAGAGA73 3GG77777T3 SZZ 

3337A3G3A3 AA37GG7773 7TAA33GG7A GGGG7G3GG7 GAGAG3A73A 73337A33AA 526 

3GA7A37733 GGG7G37T AG C3G737AGAG 7GG3AGGG7A GA3GAG7333 GA37G73AAA 5 34 

37T773GAA3 AA3AG7GGCA 73GGGA7777 AGGA77AGA3 A77333A33A 7G-333333A3 54 = 

3333GGA3A3 G73AAGA3G7 3 AAA3A3G 33 3733777373 GA3A333G33 3333333377 54 £ 

7A77A3A77A G377T3A337 3 3GGAA77 33 7AA3A7AG77 7AGA33A333 3A3GGA333A 551 

A337A33733 GGA7CGG77G GCGG7G777G 773G7TGGAG GGGG33G777 GG7G3 33GGA 5 5= 

3733AGG377 AG77TG3GAA 7GG3G7GA3G GAGAATTTG7 GG777TAGA3 3GG33AA333 5 6 4 

A7GA37737G GTG-33GAGGA A3GAAA7GAA G7TTGGA7TG 3GG77CAA77 33777A3377 r~: 

337 77-7773 7TT7GG3GA7 AGA77T77GG GA77AGG7TA GA777777A7 A7373AG7A7 = ~ £ 

73333A3733 7G7T7G377T 7AG7G7GA37 GA7TA777T7 7T7GAGAAG7 GAAA3AG3 3 Z StZ 

33GGG7GG3G G773G377AA 733AAG7GA3 G73AAG3773 AGAAA3GAA3 AGCA7773A7 535 

3AGA3A7773 AGGAA3777T 7G7G7AG3G7 77G7A777G3 GAACGG77T7 7G7G773AA3 5 34 

7AGGGAGAA7 A7777A7777 7377773373 GA7G7G3G3G 7337GG7733 73A3AA73-33 i-Z 

7G77G7GG77 77 AAA 3 3 A GG A3A33777AG 333A737773 7733A3A73A 3AGA7GGAAA zZZ 

____^_ r GC 33733— GA3A3A7377 373GGAAAAA AA3A33777A 7GG7AAAAGA 635 

AG37733— A3GGA37A37 77737A7AA7 AAAA773773 G73AA777G3 GGA7G777AA 543 

AA7AG77TTT 7G3AGGG7G7 7A3GAAG373 G3AG777A77 77 A 37 3 77 AA 77A37A7377 6 4 = 

3G7G7TGAA7 A7GG7GA777 73AA37TTTG 3 AAA 77 GA 33 7G77T73733 G7733AG3A7 i 54 
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FIGURE 3A-5 
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FIGURE 3A-6 




GGT 373C3A7C 



z^zzzzzrrz c"aa"ag~ 3aaac3333? 

AG~7A73AG 3AA733-A3A A"""" 



= -e ; 
= = : : 
= z = : 
r : 
= : ; : 
: 

= 11 : 
51= : 
: 

?3 : : 

r3et 

: 

946C 
960: 

56£ : 
5 - r : 
5?E : 

964 : 

95: : 

99£: 

i::s: 
i: : 

ii;e : 

i :3E : 
i : : 
i : 5 : : 
i: 5t : 
: :£ : : 
106= : 
i:~-i : 



WO 96/06159 



PCTVUS95 1019J 



9/48 



FIGURE 3A-7 

TTT73GG~TG AGAGT337GA 3GA3GGA33A GG7T"AGGA 3A.3CAZA.TTr 7A7A373 7~ t iTri: 

3A33333773 A3A773A737 7T373G33T7 337T7333T3 37A7333333 A337AG377: ZZtzZ 

33A33 33-373 A7777T73AAA 7TA733A33A 3A7733T73: 7733A3A33" 3 A 377 G3 77 A 1 : : -1 : 

773A73A3-7 A773333333 333A337G3 3 3333A7AA77 A3A3A3A73" GA37AGAT73 ZZ.ZZ 

77A33A3373 777A73A7T7 7733AGG33A 33337A7CA3 3A33333A3C 733A~3A77A Z.Zi - 

7A7 IAAAA73 AAA3 33-33 33 73GAAA3333 373A73333A AA3A3AA733 A73A337G3" 11 3 3: 

A7A3A373 77 GGGGTTG2T3 GCTTGCGAGAA G™3"r3G7 7T3A377A73 G7GAG2TGG7 

AA337333A3 A7AA77373A 33333373A3 A7373A3377 3337A77T7T A3A73373A3 1-34; 

3AA33733GG GOG 3373 3GG 3373GG7GG7 37337373A7 337TA3A37A AG3AAA333: 114:: 

A3A3 3 377-73 7T37A33A33 ATTCAATAZZ AGACCCCGCG 7A33AA-3__ ^ - -.4 6- 

3AAGGGGTGG GZ7~733GA3C GTSGCTCCGT 3GGCGACGT3 77A7A3AA7A 73A37T7T33 1153: 

3 3 A3 A 373 33 CTGCCGGGCA TGTACAGTZ3 7TG7C3GCAG 7T77T73A3A AS3AAGA3A7 11SBC 



7A733337AZ AA7A3G-3G37 




.is*: 



its: 
is;: 



— -A3 777-73 3 3A7 A 733TG3A3GA GGGCTA7GG3 AG3GTGGGGG GGAGGGAGAG 
A3 7*7 A7 3 377" 3733A37A3A 7373AAA3AA G3A3A3A3AT 3 33 3 3AG7 A Z A3A7GGG33A 

3TA7773A7-7 3AA3AG-3733 33333A73AA GA3A37A7TA AA3 37333AA AGAAGG7GG7 I IB 8 

__ A __ A -- TA a 3 337T 77 A3 337T3G7TA3 73A73G3A37 33A3AA3A3~ A7A373-37TA 1194 

-GA3777-7 7 377GA7GAA7 733333333" 73A373AAAA A7A3-GGA333 ZZDZ 

233A3A7T37 3A73377TA3 AAAA7A7A3A 3G3A77GG-33 77GG-3-37333 12 Dc 

7A73373A33 73A3A3A737 3333A37A3A 77 3 AAA 77 A 7 33A37A777A 733AA3733A 111! 
3A77333737 73733A3GA3 37-7333333-3 AGAG377G33 GGTAAGG3G3 A733A77777 

773A:AA777 73A3A7AAAA AA337A7A73 73777 7GA373GGA3 AG3AA3A33: 1^.4 

A3 37A3 3A37 3 77A33733A A333 3AAA3GA7T: 3A33A7737' 7773AA733A 

3333373— 3 733A733733 73733ZA33A AG7GGOG3A3 337A77773: 

3G737A73AZ A777A3T33A AAA7A77G3 Z 3333-333373 3A73A737AA 1342: 

A77T3A737A 7A7A33A33A A737373A77 7A73GGG3G3 A3A7A77373 1242 : 

377A73A333 7T3G7T373A 3-3GGG73T77 3 337 773 77: ACGG33rTr3 Z2€Z: 

3A333A7337 373A3A3A33 737T7GAGGG GG7GG7G33A 3A73A3373A 1Z66 : 

33A33A7A3A 77~33A33A3 7T3A333777 3A3A73A3A7 3A73A33A73 33337 3A73' ll": 

73777TA777 T3A3A37373 A37T7TA7A7 7T AA777T GG 3733A3A373 IA33737A73 12" = : 

7GG3-30GGG7 A7GG3373A7 77GA7A7T77 G73337T33A A33A3G3A73 A3G3 773337 ' " ' " ' 



= : 



133c : 
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FIGURE 3A-8 

7737GAA373 7377733373 3A337A3GAA AAAAAG7A77 3777A77733 333A3A773A 13 ri: 

A333733373 37A3A3A333 A3333A73AA A3 A3 3 37773 37GA3G3AG3 3GA77G733A 12 ZZ: 

"A73G3733A AA37777A77 3733737377 333773AAA7 7773A3AA33 37773AG3A7 - ^ - 

3373777777 7377A77377 A77733AAAA 77373737A3 77A3733737 777737377: - - - — 

3377AA7777 7A377G37AA 7733773AA3 3A773A3777 3333A33773 773733 TAT 7 Z2ZZ. 

73A337337G 77373A3A37 3GG373A3A7 G737G3G3G7 7333A73A33 33A77T737G ZZZi . 

7A3A3333A3 3G3GAG3GGA G3Z3TA733A 77A7733A73 33377TA3AA 3337777737 12ZS'- 

A-77A73G77 AA3A3377A7 37GGA33A37 3A3A773737 C 3 3G3A3AAA 773A3A3337 Z 2 4 4 : 

A737337AA3 3A3A773A37 GGGAAAA777 G37G337A73 77T737373A 77A777A733 ZZSZ 

777AGA77A3 AA77373A3C CGGAAG3A77 3777A73333 A3 333 3 A7 77 773A73A373 13 5£: 

73A77A73GA 77A7T3A777 CTCAG737CG GAGG37GG33 7GG377AG33 377333A337 13 63 C 

37337GGGAA 3A7A777A3A A33GGA777A CCTAG3TCGG 377G-3AAA37 373A333373 1268 : 

GG7GAA7GTA 733A33377C C77TGA7TCT AAA7T7TAAA 7AAAGG737G 7 3A 773 377 A 13'4: 

3A33A33A77 AAAAA33A77 CA37GAGA7G 7 AA3 33 77AAGOGA 77A7AC3GOG 138:: 

A~~ 7 AAA;. 7 3 3G33A773A7 A333 7AAGA3773G 3737773-3-3-3 33 .... 3 3A7 13 86: 

73777737TG 7AAA77A7A7 A7AA37TAAA 33AAAA7733 3AGG3AGA3A A3-373A3GG~ ZZrZl 

3373A3AA77 3A377GAGAG 73AGAGAA7A 3A37377AA7 GA3G37A3A7 3A33A73A77 12 5S: 

-737333777 3GA37CAGGG GAG-GAA7GG7 GGA3GG7733 G7377GG7G3 GAA7GG3ZAG 14 04: 

3AAG3GT733 373A773-373 77A7AA3A37 0777773773 77A373A7A3 G3G37TG337 1417: 

77A77G7733 A 773333737 7777GG3GG7 73GA773733 333G33A337 3A37A3-33A3 Zt Zi - 

G333A33G73 G337A77A33 7377T3G3A3 37733GA333 3A33 373G37 3A3A733A33 142:: 

333337373" 3 37A7AA7G3 A73-337A7G3 GGSGGGGG73 7A3ATAA773 A333373737 1434 

77T7A77373 333A7A7GG-3 37737 A3A7 7 73777A73A7 3 337 A3 3 3A 7 AAAA7G3-377 144 7: 

A73A3AA373 3 3A3AA37AA 37737333A7 A7 37 3 37777 3 3773-333 37 37A73A3737 144£: 

37773A7337 7AA33333A3 33A7773A73 333333A337 37777A77A3 37A3-373A37 14 = 1 : 

3G373AAA73 A7A77773A7 G3737733A3 A33A333737 G037333377 3A3S373333 14 5S: 

A773G33A77 33A3G33337 3 77 77737 3 A 37A7A3737A 3AAAA737 A - 3333A33333 14": 

3A7A7T7-337 7AA3GA33G7 733AA73337 A3 33 3AGAA7 3773337377 3773337T33 14": 

A733333337 73A3333777 AA77373373 73GA33A3A3 33A7337377 AA73A73373 14S2: 

~A7 3 33 3A33 A3377A7777 37AA33A377 37333A373A G3773333A3 3AA3AAA737 143=: 

773A73333A A7A7GA3G77 777GG73-GGA G7773A37A3 77373A3AA7 3333333A33 1454: 

33A37A33G3 A333773337 3 77 377 337 A 7A333GSG-37 3GA7G7AAA7 AAA 3 A3 3737 15": 
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FIGURE 3A-9 

rrXA33C A37T3TGAA3 3T3373GG33 373373737A 3333A3A3A7 373AAA7737 I 3 : f : 

"3 3373A33GA3 3A3333G7AA A3337A33A7 773A33A332 737773373: - 

-------3^ 7333333733 A7AAAGAG37 7337337337 77T3A733A7 131=: 

"A733 73A7G7A337 33GAA73733 AG7G7G7AA7 AA33A3A33A 3A37A37333 - I : - ' 

3ZA337 GGAAA73773 733G733733 GGA3A3A3G7 AA373A33A3 A773A33A73 1= •: : 



T 7 77 3-3 3 33773 37 3 A3 37TAA33A3G AAA 37 3 37 33 A3 3 S3 7 37 3A 3: 





37A7773T33 ' 

33337A7333 3333A3A333 333337A337 3A3337T7 33 A3-3AA7733A GA33373373 1€BS 
~-;--;~7A333 333A3333-G3 A373A333G7 SGA33377G3 




_ A __ AT ^ 7; ^ — A7AA377T GG 3 37 7777 3 73AA33A73A AA733A3AA7 A373333A37 - * • 



7AA7AAAA7T 3333733777 A337A3A33A 3 3 77 AAA 3" 373777733- 13-4S: 



: 5 r-; : 

r73TTGG77 77T3GGAA7A AAAGGGGG33 73T373333A 1 = £ : : 
733A7337A7 G3G7GA333A 37GGA73373 GACA737GG7 3AA73A3AAA GA7TT73AG3 1 = ^C 
^..-.--^^ , 7T 3A37 3AA333377A GGGAG3AA37 G3773333GG 37 7 A3 33 3A 3 1 3 " 

- ZZXZZGZ ~- r5G TrT3G37 GA3777C7A7 G7CAGAAAA3 A 3 AAA 3 A 77 7 733773773A 15 78 C 
Tr - A -~- 3GT 33G3A73GA3 73 3777 3 CG A GGG7G7G33G 3G7AGAGA33 GG733GAAA7 
3AA3A37G7G 333A373377 7A7373AG7T CGAGGG337G 7777333737 A33A37G733 
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33A3A33373 AT3TG3GAA3 7AA3GGG7AA 7737A73377 G33AA2AT77 AA3AG-GGG3A 
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_iis : 



15S4: 
1590C 

: 3 96 : 

ie:r: 



3333A3AAAA 73GA73333A 733TA73AA7 3AAA3A7A33 7A33.37.73 7733 
773G333A33 7337377333 7AGGA377T7 3737733A72 A7323A7A23 337GGAG377 

SGAAA33T37 777322A333 GAGGAAAA33 AA77T33377 7T3A37r777 15s-- 

r „ rT 3--- r _ G337333 7773777733 3A33A7GGGA A77773A733 A777GA3A72 -iii : 

73337A7733 33ATTT3373 3327337777 AA7777A37Z 7TA37373A3 A7777777A.7 1 £ " 4 : 

I73337A7 GGGGG33GGA 333AA7AA73 3333GA3723 GA333773A3 If ST 



37333733A3 AAA3777A3A 



1= =1 



WO 96/06159 



PCT/US95/10194 



12/48 



FIGURE 3A-10 




GSCrTTGATT GA^TCAJ^A AGAAAAAAG 

CA3rc«GT3: ArcrrGGGAA cgtaacas" 



; - : = : 
:*:■■;: 
: ~ 4 : : 

: 

: - = : : 

:~56: 
; -c-i : 

:~76 : 
:~bi" 

l^BBC 

leooc 
leoec 
i s 12 : 
1 6 1 e : 
1624: 
i e 3 c : 

1536: 

.54; : 

1B48: 

125.4: 
ibc : : 
less: 
is": : 
1 e-»8 : 
1 = 34: 
issi: 
iesc: 

15-i : 
isi-i : 
isi:: 



THIS PAGE BLANK (uspto, 



WO 96/06159 



PCT/X*S95'ltU t *-l 



13/48 



FIGURE 3A-11 




3773A3AAA3 GA33AAAG37 7AA33AGAAA A33773AA37 73333-37A3A 
3733A3-3GA3 "337373777 TT3T3333G3 77G-G7A7733 A33A3773A7 33-3373733A 
733733333A 3A3A3333A7 77GTAA37A7 377 3 A7 A 3-3-3 A7733AA7A3 
G3G3SG3TT7 G3GGGGAGG7 7G7377333A 33G777G3G3 G77733A337 3333A~7733 
-3A3373033 3733 3333 A7 3C73333737 G3A37A7377 733A3-3A333 AA3737A73A 
___ TATTr - 3 73CAZ3A333 3AA7AAAAGG 3733333A73 337333377- 7333A3A373 
7333373AA7 3TGGOGGCA3 7CAG7TC33A 3777737333 333777TT3-G 3337773773 
G3AAGG3G77 3377G7GA33 3373A33A33 A73TT737T~ 73A3 373337 
G-37GA3333G A3G7GA33A3 7AGC7A3G73 AAG-33337A3 AA 377 37 377 
33A37-7A33 3AGAACA7A3 7T3GACAA7T TAAG7TGA7A GAAA3T3333 73AA3A337- 
377C7T3GT7 7CCAACG7GA 7GC33GAGG7 CCAG7CAAT7 7GGAG7GG33 3-3 3 3 3-3 3 377 
T77AG7AA73 73CACTTG33 7AGA773GA3 AA33733AGA GAG7C7773G 
G3A3G-37773 GGG-G3GGGG-3 G7GAGGAAA7 333A773GA3 333777 3A33 7 A3 AAA TATA 
33AAAA3GG3 3AS3737T77 A3AATCA77A 7377A333A3 GAG7GGA3G7 GGG7777GA3 
7773AA7AA3 3A73333733 77330GAGG3 737A3A7GG3 7737373A33 73G-GAA77TG 
3AA3037TT- 777377 3A33 A333377T73 G77G77ATGG 773 77377 3A A33-3A33333 
„__-- 7GGG33GA77 3773 3 7737 A TAA33A33A3 7G3GC7TA33 33GG3333G7 
377A3-7T3A GG73A3ATS7 A3337333AA A33-3GA7777 77G77G7T33 77AA73A73: 

7A7A3GGA3A 77777 3 3 3G 3 A3A733333-3 33G 3773 333 
GGA7ATAAA3 GG7A37GAG7 3A3A73AAAA 73A7AGA733 
-7333A3A37 7A3A7T7 333 A3A377G377 7T7 G7373A3 A7A7A73A33 AAAA7 A3 3AT 
AA7T33GOGT 3AGGGGA333 A337S3G73-3 AA7377A773 77GA373GAA AAGGGA333A 
37A7A7AA3A 3G3AA73773 AGA333AAA3 37373 3AA 37 A 333-3 3 3A 37 A777AA7GA? 
3 3 3A7337A7 3A3A7A3333 33A73A73A3 3A73A73AA3 3A3AA73-3A3 73AA33AA37 

AGG33AA3A7 3AA3A3GAG-3 33A37737A7 

37AAAA3A3A -= ~- "~ 
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FIGURE 3B 

SEQ. ID. NO. 36 
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FIGURE 3C 

SEQ. ID. NO. 37 




"GG GGGGGGGTGA TGGGAT-.G7 
TTAGTCG3GA GAA3ATA3GG CCCTTGGGAA GCCGGTGAAC CAGCTGGAGG GTGTGGAAGA 
TGCGGAGGGG TTGTGGGAGC TGTCGCGA7A GAGGTTAGGG TAGGT3TCGG GTGCGTCGGT 
3GGGTGAAAG CTGCCCAGAG ACACCACTGT CTGCTGGGGG ATCATGGTTG TGAGGGAGAT 
G G ATT GTTTG GAAGTAGTGG TAGAGATGGA GGAGAGTGCC AG-GGGGTTGC AGGAGTGGTG 
GGGATGGTGG GGAGGGTTTT TAAGAAACGG GGGAGGGTGG GGAGTGGGGG TGGTTGGAGG 
ATGTGGGGGT GGTGTAGGTG CTTGGCGAAT ATGGGAGGAA ATGGGGTGTG GGGAGGGGGT 
GGCAGGGGCG GTGGGGTGGC ATAGAGGCGG GTGAGGGCGG GGTGGGTGTG TCCGGGTGGA 54 
AAGAGGGTGG TGTGAAAGAA CAGGTTGGAA GGGGGGGAAT AGGGGTGTGG AGGGTGGTGT 6C 
GGACGTGGGT GTATGGTCCG TGGATGC 



IS 
"4 

3C 
36 

4 : 

49 



THIS PAGE BLANK (usrto, 



WO 96/06159 



PCT.X'S95.l01 t >-» 



16/48 



FIGURE 3D 

SEQ. ID. NO. 38 
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FIGURE 3E 

SEQ. ID. NO. 39 
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FIGURE 3F 

SEQ. ID. NO. 40 
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FIGURE 22 



PCR anaivsis of KS330233 m DN'A samples fro™ patient: 
with Kaposi's sarcoma and ramor controls 

K5 K533D233 
No. tested positive (°/o) 

K5 tissue: 

AID5-K5 24 12 (92) 

Endemic K5 20 17(53) 

Total 44 39 fS9) 

Control Tumors: 

HIV seropositive 7 1 (14) 

HIV seronegative 15 2 (13) 

Total 22 3(14) 



BLANK 



WO 96/06159 



PCTVUS95/101Q4 



38/48 





T H® PAGE BLANK 



MFTO) 



WO 96/06159 



PCT/US95/10194 




TH/S RAGE BLANK 



(OSPTOJ 



WO 96/06159 



PCTO. : 595'10194 



40/48 




TH§8 PAGE BLANK (uspto) 



WO 96/06159 



PCT/US95/10W 




WO 96/06159 



PCT/VS95/10194 



42/4S 



FIGURE 27A 



CD 

o 

CD 



CD 



- 53 N A 



_.. 5BNA 2 
- E3NA 1 



-D21 



CSV i'-: R^' 



SUBSTITUTE SHEET (RULE 26) 



BLANK 



(USPTO) 



WO 96/06159 



PCTWS95/10194 




SUBSTITUTE SHEET (RULE 26) 



™M BLANK mPT0) 



WO 96/06159 



PCT71S95/1019J 



44/45 



FIGURE 2BA 
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